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Preface 


This book is about the calculus of variations which is a subject concerned mainly with optimization 
of functionals. However, because part of it is based on using ordinary calculus in solving optimization 
problems, “Calculus of Variations” in its original title is modified to become “Mathematics of Variation”. 
In fact, the book is essentially a collection of solved problems with rather modest theoretical background 
and hence it is based on the method of “learning by example and practice” which in our view is the most 
effective way for learning mathematics and overcoming the difficulties of its abstraction. The main merit 
of the book is its clarity, intuitive structure and rather inclusiveness as it includes the main topics and 
applications of this subject. The structure of the book is that it starts with a preliminary chapter which 
provides a general theoretical background about this subject with many solved problems related to this 
background. In the remaining chapters we present a number of common topics and applications related to 
the mathematical methods of variation. So, the remaining chapters consist essentially of solved problems 
classified according to certain mathematical and physical criteria with some introductory and general 
background. 

Because the book is about the mathematical methods of variation, we do not explain things like inte- 
gration or partial differentiation or how to obtain solutions of differential equations although we usually 
make short explanatory remarks or put the results in a format that is easy to understand and verify. 
We also avoid going through many theoretical details and technicalities of the calculus of variations to 
avoid unnecessary distraction from our main practical objectives and to save space. For example, we 
do not go through the derivation of the formulae of the calculus of variations because such details can 
be found in almost every book on this subject. Similarly, we avoid going through technicalities related 
for example to the nature of the arguments of real-valued functions (such as the square root or natural 
logarithm) and how and when these arguments should be non-negative. So, in brief we take things rather 
easy in presenting the subject using general understanding and common sense in dealing with the varia- 
tional problems. Accordingly, the materials in this book have essentially practical objectives rather than 
pedantic mathematical purposes. So, the target of this book is essentially scientists, engineers and applied 
mathematicians rather than mathematicians. 

The materials in this book require decent background in general mathematics (mostly in single-variable 
and multi-variable differential and integral calculus). Some problems and applications also require rea- 
sonable background in physics which is the main field of application of the calculus (and mathematics) of 
variations. The book can be used as a text or as a reference for an introductory course on this subject 
as part of an undergraduate curriculum in physics or engineering or applied mathematics. The book can 
also be used as a source of supplementary pedagogical materials used in tutorial sessions associated with 
such a course. 

Taha Sochi 
London, October 2020 
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Nomenclature 


In the following list, we define the common symbols, notations and abbreviations which are used in the 


book as a quick reference for the reader. The list may exclude what is used locally and casually. 
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nabla, differential operator 

gradient of scalar f 

total derivative (usually with respect to coordinate z, i.e. d/dx) 
total derivative (usually with respect to time ft, i.e. d/dt) 
boundary of domain 2 

lengths of semi-major and semi-minor axes of ellipse 

lengths of semi-axes of ellipsoid 

the speed of light in vacuum 

parameters in 1D Rayleigh-Ritz method 

parameters in 2D Rayleigh-Ritz method 

the speed of light in material media 

constants 
constants 
infinitesimal line element of curve 

eccentricity of ellipse 

Equation, Equations 

function 

integrand of functional integral 

magnitude of gravitational acceleration or gravitational force field 

constraints (in Lagrange multipliers technique) 

integrands of constraints (in Lagrange multipliers technique) 

function to be optimized with constraint (in Lagrange multipliers technique) 
height 

integrand of constrained variational functional (in Lagrange multipliers technique) 
moment of inertia 

functional integral (optimized or stationarized by y) 

length 

Lagrangian (in Hamiltonian mechanics) 

length, width 

mass 

reduced mass 

refractive index of light in material media 

perimeter 

radius 

position vector 

position vector of the center of mass 

resistance to fluid flow 

the set of real numbers 

spherical coordinates of 3D Euclidean space 

length of curve 

surface 

sum (representing discrete form of functional integral) stationarized by y1,--+ , Yn 
time 

kinetic energy 

rotational energy, translational energy 

potential energy 


v speed 


Vw speed of wave 
v velocity 
V volume 
L,Y, 2 coordinates of 3D Euclidean space (usually orthonormal Cartesian) 
Yi the i*” 1D Rayleigh-Ritz approximation 
Y1s°°* Yn y values of function points in 1D finite difference method!" 
yO the i*” derivative of y (i.e. yO = diy/dz*) 
Vii partial derivative of y; with respect to a; (i.e. yj; = Oy; /02;) 
Yar partial derivative of y with respect to x, (i.e. yz, = Oy/O21) 
Sing the mn” 2D Rayleigh-Ritz approximation 
Z11,°°* 52mn z values of function points in 2D finite difference method 
T curve 
A, A1,°:+;An Lagrange multipliers, eigenvalues 
Am, AM minimum, maximum eigenvalue of Sturm-Liouville problem 
Lb linear mass density 
p,@ polar coordinates of plane 
p, 0, Z cylindrical coordinates of 3D Euclidean space 
a area 
7 tension force 
go,°+* » On basis functions in 1D Rayleigh-Ritz method?! 
® gravitational potential 
W angular speed 
Q set of functions, domain of multi-variate function 
1 In fact, these y values (as well as the upcoming z values z11,--- , mn) are supposed to be approximates of the extremizing 
function at these points. 
[2lIn fact, we should also have ¢0,¢11--: ,¢mn basis functions for the 2D Rayleigh-Ritz method but we did not use this 


notation in this book. 


Chapter 1 
Preliminaries 


In this chapter we provide an outline of the calculus of variations and its basic principles. In the subse- 
quent chapters we will investigate various categories of variational problems using different methods and 
approaches which are mostly based on what is outlined in this chapter. However, before we start we make 
a few introductory remarks of general nature. 


1.1 Introductory Remarks 


In this section we present a number of general remarks related to conventions, notations and commonly 
occurring issues in this book. 

e As indicated in the Preface, we have no primary interest in the theory of the calculus of variations (or the 
mathematics of variations to be more general) due to the fact that this book is mainly of practical nature 
related to solving variational problems rather than presenting and justifying theories and deliberating 
on abstract issues. However, we provided summaries of the required theoretical background if and when 
needed. 

e Because of the nature and objective of the book (as outlined in the previous remark) we do not go 
during our investigations and problem solving through details and technicalities not related directly and 
strongly to the calculus of variations such as the restrictions on the domain of validity of functions or the 
constraints on the obtained solutions and roots or the methods of solving differential equations. 

e Although the book is supposed to be about the calculus of variations, in many problems we use ordi- 
nary calculus to solve the variational problems and hence the subject of the book is the mathematics of 
variations with the calculus of variations being the main subject. 
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e The terms of 2D shapes (such as “circle”, “rectangle” and “square”) may be used conveniently as surfaces 
or as curves (defined by their perimeters). Similarly, the terms of 3D shapes (such as “sphere”, “cube” and 
“parallelepiped”) may be used conveniently as solids and as surfaces. A matter related to this issue is the 
use of “surface of revolution” and “solid of revolution” (where the latter is usually used when volume is 
involved). 

e Some terms may be used conveniently to represent an object or its quantitative measure. For example, 
“perimeter” may be used to mean the curve or to mean its length and hence we may say “this point is on the 
perimeter” or “the perimeter is equal to 10”. This is to ease the text and avoid unnecessary complications 
and distractions. Accordingly, the context should always be taken into account to determine the ultimate 
meaning. 

e All the functions and functionals in this book are assumed to be sufficiently smooth and differentiable 
(at least piecewise). 

e All the mathematical quantities in this book are real (i.e. not imaginary or complex). Accordingly, all 
the arguments of real-valued functions that are not defined for negative quantities, like square roots and 
logarithmic functions, are assumed non-negative by taking the absolute value, if necessary, without using 
the absolute value symbol. We also exclude any potential mathematical absurdity like division by zero. 
e In solving the problems in this book we deliberately used different methods and approaches when we 
have the freedom to do so. The purpose of this is to diversify and provide the reader with more thorough 
techniques in tackling the variational problems. In fact, in some cases we solved some problems more 
than once using different methods of solution (such as using different variational approaches or different 
coordinate systems). 

e Apart from the basic and fundamental notation and symbolism of the calculus of variations (including 
calculus), we did not commit ourselves to adopting unified notation and symbolism across the entire 
book for secondary objects and concepts. In fact, in some cases we deliberately used different notations in 
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different Problems for these secondary items for the purpose of distinction and to avoid potential confusion. 
We also deliberately diversified some secondary symbols and notations in some cases for the purpose of 
training the reader to deal with different symbols and notations so that he acquires the ability and skill to 
capture and recognize the essential mathematical forms and patterns regardless of the specific symbols and 
notations used to express and formulate these forms and patterns in the individual mathematical problems. 
Anyway, the reader can always consult the Nomenclature for providing consistent (although potentially 
generic, tentative and non-comprehensive) explanation of the used symbols if no such explanation is 
provided locally. 

e The Problems in the chapters and sections represent just a small sample of the categories represented 
by these chapters and sections. We did our best to make the selected sample representative of the entire 
category so that it reflects the main aspects and the most important features of that category (within the 
restrictions on the book size and scope). 

e For optimal categorization of the Problems and to ensure better structures, we may refer in some cases 
to Problems or items (e.g. graphs) in later parts of the book. However, we ensured that there is no 
ambiguity or difficulty in understanding and appreciating the indicated content. 

e In this book we are mainly interested in optimization problems and techniques regardless of the nature 
of the optimum and if it is maximum or minimum (and whether it is local or global).!8! Therefore, we 
generally do not investigate the nature of the optimal technically (e.g. by investigating the signs of the 
derivatives) although we usually provide short remarks based on intuitive and simple (but not rigorous or 
technical) arguments about the nature of the obtained optimal. In fact, we even ignored the investigation 
of the possibility of having non-optimal stationary solutions (e.g. inflection or saddle points) in some 
cases. |4 

e The axes of most plots in this book are not scaled equally. Also, the purpose of the figures in this book 
is to demonstrate and outline the main features and settings of interest in the concerned problems, and 
hence these figures may not be realistic in their shapes and dimensions. 


1.2. The Calculus of Variations and the Variational Principle 


The calculus of variations is a branch of mathematics whose purpose is to find methods and techniques for 
optimizing (i.e. minimizing and maximizing) functionals (and possibly functions as well as special cases). 
The methods and techniques are commonly analytical where the tools and techniques of single-variable 
and multi-variable differential and integral calculus are used extensively. However, these methods and 
techniques have usually numerical implementations and instantiations as well (see for example § 9). This 
branch of mathematics is exceptionally beautiful and motivating. One reason for this is that most problems 
of this subject have physical roots as they originate from real physical issues and challenges. In fact, the 
calculus of variations is exceptionally important to physicists because variational principles and paradigms 
permeate the entire physics and they are at the heart and foundation of many physical problems and issues. 
This has also historical roots because most of the ideas and techniques of this subject were developed 
and cultivated within physical contexts and environments to meet urgent demands for solving real-life 
physical problems. Therefore, physicists and engineers in particular must acquire sufficient knowledge 
and familiarity with this vital and charming subject. 

The entire structure of the calculus of variations (and indeed any variational technique) is based on 
a very simple principle which we call the variational principle. The essence of this principle is that the 
variation of any function at its extremums vanishes.!>! So, to find the extremums (or optimums) of a given 
function where the function reaches its maximum or minimum values (usually locally) we should search 
for the stationary values of the function where its derivative(s) vanish. This principle is intuitive because 


3] This is mainly to reduce the size of the book and provide more space for the main objectives of the mathematics of 
variation (noting that the technical investigation of the nature of optimums is usually lengthy and technically messy). 

4l In this regard, the reader should note that in some variational problems optimal solution may not exist (within the given 
conditions and circumstances). 

5] In this context, “function” is more general than “simple function” and “function of function” where the latter is known as 
functional (see § 1.3). We should also note that although “extremum” suggests maximum or minimum, it is used in some 
texts of variational calculus to include even inflection (or stationary) values. 
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at an extremum the function should change its trend (i.e. from increasing to decreasing or from decreasing 
to increasing) and hence at the extremum the function should cease to vary abruptly. In other words, a 
positive/negative variation trend followed by a negative/positive variation trend should be separated by 
zero variation. 

As we will see, optimization problems generally require the methods and techniques of the calculus 
of variations for their solution. However, some simple optimization problems can be solved by ordinary 
calculus with no need for the calculus of variations.!6! But even these simple problems usually require 
a variational argument (which is based ultimately on the variational principle) for their solution (see for 
example the Problems of § 2.3 and § 5). In fact, some of these simple problems can also be solved using 
the formal techniques of the calculus of variations although this usually incurs an extra cost and hence it 
should be avoided (unless it is needed for legitimate purposes). 


Problems 


1. What is the main objective of the calculus of variations? 
Answer: The main objective is to find and develop mathematical methods and techniques for opti- 
mization of functionals (i.e. finding their maximums and minimums). 

2. In the text we described the calculus of variations as a branch of mathematics whose purpose is to find 
methods and techniques for optimizing functionals. Comment on optimizing. 
Answer: To be more general and inclusive we should replace “optimizing” (or extremizing) with 
“finding stationary values” (or what we may call “stationarizing”). However, optimization is the main 
objective in the investigations of this subject and hence almost all the problems of the calculus of 
variations are related to optimization. In fact, most problems in the calculus of variations are related 
to minimization. 
Note: in this book we commonly use terms like extremizing or optimizing to mean stationarizing 
and we rely on this understanding (noting that extremizing and optimizing are more common in use 
and these cases are more frequent in occurrence in the real life and hence they are the main focus of 
investigation). 


1.3 Functionals in the Calculus of Variations 


In simple terms, a functional is a function of function, e.g. ¢[f(x)] where ¢ is a functional that depends 
on f which is a function of x. So, a functional ¢[f(a)] in an interval [71, x2] is a function that depends on 
all the values of another function f(x) in that interval. A functional in an interval may also be described 
as a map from functions defined on this interval to the real numbers. In the calculus of variations (of one 
variable), the functionals that are commonly dealt with have the following form: 


rul= f ” Bwana) de (1) 


where I is a functional of the function y = y (x), y’ = dy/dz, y (x1) = C, and y (a2) = C2 (with C; and C2 
being given numbers). The notation [y] indicates that the variation and extremization of J is essentially 
determined by the form of y while the notation (x,y, y’) indicates the dependencies of F' (which is the 
integrand). We should also note that the choice of I to symbolize the functional is to indicate the fact 
that the functional is an integral (and hence we usually call J in this book the functional integral). To 
summarize the essence of Eq. 1 we can say: the functional J whose variation/optimization depends on 
the nature of y (which is a function of ) is the integral of F' (which is a function of x, y, y’) over the real 
interval [x1, x2]. 

It should be noted!"! that in the formulations and applications of the Euler-Lagrange equation (which is 
the essence of the calculus of variations and is based on the functional of Eq. 1 as will be investigated next) 
the symbols x,y, y’ are treated as if they are representing variables that are independent of each other 


[6] In fact, some may be solved even by simple arithmetic and algebraic techniques based on simple intuitive arguments. 
\"lTn fact, this note applies to the form of Eq. 1 (i.e. single independent and dependent variables). For other forms (e.g. the 
form of Eq. 14) more details are required (as will be seen in the upcoming chapters, sections, Problems and applications). 
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(although in reality y is usually dependent on x and y’ is usually dependent on x and possibly y) and this 
is reflected in taking partial derivatives with respect to these variables (as will be seen in the upcoming 
examples and applications). It is also important to note that the symbols x, y, y’ should be seen as generic 
symbols and hence they are not necessarily representing variables in Cartesian systems. For example, 
these symbols can represent ¢, p, p’ in 2D polar coordinates or ¢, z, 2’ in 3D cylindrical coordinates (where 
the prime represents d/d@) or t,x,“ in mechanics (where t is time, x is spatial coordinate and « = da/dt). 


Problems 


1. 


Give a brief symbolic definition for “functional” as a mapping. 
Answer: A functional ¢ is a function defined by the mapping ¢: 2 > R where 2 is a set of functions 
and R is the set of real numbers. 


. Make a simple comparison between functions and functionals as mapping devices. 


Answer: Functions map numbers to numbers while functionals map functions to numbers. 

Make a simple comparison between optimization of functions and optimization of functionals consid- 
ering the number of variables involved in the optimization process. 

Answer: The first is an optimization of functions of finite number of variables while the second can 
be seen as an optimization of functions of infinite number of variables. 

Compare the functional notation I[y] to the function notation f(x) and outline the significance of 
each notation. Also, outline the difference between functional variation (in functional relations) and 
function variation (in function relations) at stationary values. 

Answer: The use of square brackets in I[y] is to distinguish functional dependency (i.e. the functional 
I depends on the function y) from function dependency in f(a), i.e. the function f depends on the 
independent variable x. So, in I[y] the value of J depends on the form of y on the entire interval 
[v1,%2] while in f(x) the value of f at a certain point depends on the value of x at that point. 

In functional relations the variation of y anywhere in the interval [11,22] will disturb the stationary 
value of the functional (and hence the stationary value of the functional is obtained from a single form 
of y over the entire interval), while in function relations the variation of « away from the immediate 
neighborhood of the stationary point does not disturb the stationary value of the function (i.e. the 
function and its stationary point have only local dependency). 

Briefly explain the objective in the problems investigated by the calculus of variations and how they 
are dealt with. 

Answer: The objective is to find the form of a function y = f(x) such that a definite integral I of a 
given expression F' that involves y and its derivative y’ = dy/dx is extremum. 

To deal with these problems we start by forming the integral: 


2 
Til= fF yy) de 
x4 
where F' is formulated according to the description and statement of the problem. We then use the 
techniques of the calculus of variations to find the solution y. Any unknown parameters in the solution 
may then be determined by using the given boundary conditions and constraints. The main step in 
the solution technique of the calculus of variations is the formation of the Euler-Lagrange equation 
from the obtained F' where this equation (in its various shapes and forms depending on the nature of 
the problem and the form of F’) is solved subject to the given boundary conditions and constraints to 
obtain the final solution. 

What is the difference between the variational problems in ordinary calculus and the variational prob- 
lems in the calculus of variations. 

Answer: In ordinary calculus the variational problems are about finding points (2;, y;) that extremize 
(or stationarize) a given function y = f(x), while in the calculus of variations the variational problems 
are about finding functions y(a) (represented by curves) that extremize (or stationarize) a given func- 
tional J [y]. So, in ordinary calculus we are looking for points while in the calculus of variations we are 
looking for curves (represented by functions).!§! This is demonstrated schematically in Figure 1. 


[8] In other words, in ordinary calculus (the form of) y is known (e.g. y = «2 +1) but its extremum (or stationary) points are 
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Figure 1: A schematic illustration of the difference in the nature of the variational problems in ordinary 
calculus and in the calculus of variations where on the left (representing ordinary calculus) the objective 
is to find the extremal points (1M and m) on the curve, while on the right (representing the calculus 
of variations) the objective is to find the extremal curve (solid) among all other curves (dashed) in its 
immediate neighborhood that connect two given points (A and B). See Problem 6 of § 1.3. 


1.4 The Euler-Lagrange Equation 


The Euler-Lagrange equation is a relation that implements the variational principle in a specific mathemat- 
ical form when the function that should be optimized is a functional. More specifically, the Euler-Lagrange 
equation is a mathematical relation whose objective is to minimize or maximizel® a certain functional 
I[y] which depends on a function y by finding this y. It is represented mathematically by a differen- 
tial equation whose solution(s) optimize the particular functional J. The Euler-Lagrange equation in its 
generic, simple and most common form is given by: 


where F' (which is a function of z,y,y’) is the integrand of a functional integral I[y] whose varia- 
tion/optimization depends on y (which is a function of x), and y’ is the derivative of y with respect 
to a (ie. y’ = dy/dx). It can be shown (see Problem 3) that the functional given by Eq. 1 (see § 1.3) 
possesses stationary values obtained by solving the above Euler-Lagrange equation (i.e. Eq. 2) where 
“solving” means obtaining the form of y(a). So in brief, the objective of the variational problems in 
the calculus of variations (as formulated by the Euler-Lagrange equation) is to find the function y that 
extremizes (i.e. minimizes or maximizes) the functional J.!"°! 
The Euler-Lagrange equation is simplified in the following cases: 
I. If F does not depend explicitly on x li-e. F=2=F (y,y’)] the Euler-Lagrange equation (Eq. 2) will 
reduce to the following form:!!!! ‘ 
iF 
/ 
By Oy’ ae (3) 


where C' is a constant. This equation is called the Beltrami identity (see Problem 5). 
II. If F does not depend explicitly on y [i-e. Fea F (2, y’)] then oe = 0 and hence Eq. 2 takes the 


unknown and hence we are required to find these points, while in the calculus of variations (the form of) y that extremizes 
(or stationarizes) I is unknown (e.g. whether y = 2? + 1 or y = e® or y = sina, etc.) and hence we are required to find 
(the form of) y that extremizes I (so that a function of any other form or a function that is obtained by a slight variation 
of y in its neighborhood will make J deviate from its extremum value). 
[9] In fact, a more general objective of the equation is to search for stationary points (although they are usually maximums 
or minimums). 
119] To be more general, we may need to replace “extremize” with “stationarize” (i.e. find stationary values). However, we do 
not go through these details. 
111] This equation (and its alike of upcoming equations) is commonly known in the literature of the calculus of variations as 
a first integral of the Euler-Lagrange equation because only one integration is needed to obtain the solution. 
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following form: 
F F 
“ (=) =0 and hence OF =C (4) 
where C is a constant. This equation also applies when F' depends explicitly on y’ only.!"?! 


III. If F does not depend explicitly on y’ li-e. F= F(a, y)| then jar = 0 and hence Eq. 2 reduces to the 
following form: 


OF _ 
Oy 
This equation also applies when F depends explicitly on y only.!!*! 


0 (5) 


It is important to note that using these simplified forms is not mandatory although they usually (but not 
necessarily) make the solution easier. Accordingly, in the future we will use these simplified forms if they 
are convenient. We should also note that the full form and the simplified forms usually lead to different 
forms of equations even after simplification although in principle this should not affect the final solution. 
We should finally draw the attention to the following remarks: 

1. The Euler-Lagrange equation may also be given in the following forms: 


OF d OF 

Ox dx (F = =) 24 (6) 
OF OF por OF 
Oy  Oxdy! a OyOy' ¥ Oy’? 


= 0 (7) 


These forms will be investigated in Problems 6 and 7. 

2. The Euler-Lagrange equation is a necessary, but not sufficient, condition for extremizing the functional 

I and hence further investigation to determine the nature of the solution is generally required. However, 
in most practical variational problems (especially in physics) the required extremizing solution is usually 
found by just solving the Euler-Lagrange equation with no need for further investigation. In fact, in 
many cases the nature of the solution (as being minimizing or maximizing or even being inflection) 
can be easily inferred from the nature of the problem using non-formal arguments and simple intuitive 
considerations (such as geometric or physical considerations). 
Regarding the practical side of this issue, formal identification (by using technically rigorous tools and 
methods) of the nature of the obtained solution is generally difficult and requires obtaining and testing 
derivatives of various orders. Therefore, in this book we generally avoid going through these messy 
details contenting ourselves with just obtaining the solution (and possibly relying on the context and 
other intuitive considerations to identify the nature of the solution). 

3. A solution of the Euler-Lagrange equation may be called an extremal (or extremal function or extremal 
curve) which is inline with the nature of most solutions of the variational problems in the calculus of 
variations (although the previous remark should be taken into consideration). 

4, If the integrand F is a total derivative (with respect to x) of a given function of x and y, say ¢(2, y), 
then the Euler-Lagrange equation is satisfied identically, i.e. any function (of any form) that satisfies 
the given boundary conditions will satisfy the Euler-Lagrange equation (see part m of Problem 9). 
The reason is that in this case the value of the functional integral is solely determined by the value 
of ¢ at the boundaries (and hence any function ¢ that satisfies the given boundary conditions should 
“optimize” the functional integral regardless of the form of ¢), that is: 


©2 ©2 co 
Tul= f Feyy) de= f° Pav = (62 = 4, - 4, 


1 


1121 Tm fact, in this case a is independent of x and y and this should lead to a simpler equation (i.e. y’ = constant) that 


can be used instead of Eq. 4. 
113] Ty fact, in this case ge is independent of x and y’ and this should lead to a simpler equation (i.e. y = constant) that 


can be used instead of Eq. 5. 
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An implication of this is that adding a term (or terms) that is a total derivative of some function to the 
integrand F' will not change the Euler-Lagrange equation and this could lead to simplification in some 
cases where the additional term(s) can be ignored in the derivation of the Euler-Lagrange equation of 
the given variational problem (see for example part h of Problem 10 or part j of Problem 11). 

5. As indicated earlier (see § 1.3), the symbols x,y,y’ in the Euler-Lagrange equation are treated as 
if they are representing variables that are independent of each other. This means that the partial 
derivatives in the Euler-Lagrange equation operate on the explicit (but not implicit) dependencies of 
these variables on each other. For example, Oy/0x = 0 according to this type of partial differentiation 
even though y is a function of x. However, there are some exceptions to this rule where in some cases 
partial derivatives operate on both the explicit and implicit dependencies (see § 1.6). In brief, when we 
deal with the Euler-Lagrange equation (in its different forms and flavors) we have two types of partial 
differentiation where in one type (which is the common type) only explicit dependencies are considered 
while in another type (which is the exceptional type) both the explicit and implicit dependencies are 
considered. So, we should be careful about the interpretation and treatment of partial derivatives in 
the applications of variational calculus. 

6. As indicated earlier (see § 1.3), the symbols z,y,y’ in the Euler-Lagrange equation should be seen as 
generic symbols and hence they are not necessarily representing variables in Cartesian systems. 

7. The term “Euler-Lagrange equation” may be used to refer to the Euler-Lagrange equation in its general 
form (i.e. as given by Eq. 2 and its equivalent and reduced forms as well as its upcoming generalized and 
extended forms) and may be used to refer to the equation obtained from applying the Euler-Lagrange 
equation in its general form to a particular problem and hence the Euler-Lagrange equation in this 
case is an instantiation of the Euler-Lagrange equation in its general form (i.e. it is the Euler-Lagrange 
equation for that particular problem). The meaning should be obvious from the context. 


Problems 


1. Describe in plain terms a simple typical variational problem and how it is formulated and solved. 
Answer: Suppose we have a functional J which usually depends on x and y(a) as well as the derivative 
y’ = dy/dz, that is: 
T= $¢(2,y,y') (8) 


This functional is in the form of an integral and its optimization (or stationarization to be more 
general) depends on the form of the function y assuming that y is fixed at its two end points A(x, y1) 
and B(x2, y2), ie. y(a1) = yr = C, and y(x2) = yo = Ca. Moreover, we are interested in optimizing 
this functional by finding the form of y that achieves this optimization. Accordingly, we modify the 
notation of Eq. 8 (to make it more explicit and suggestive) to the following: 


Tul= [Foye de (9) 


1 


where the notation I[y] is to suggest that the required optimization of J depends on the form of y 
(which we are looking for) and where F is the integrand (which depends on z,y,y’). In brief, Eq. 9 
suggests that if we use a certain function y (which we are looking for) in the integrand F then the 
integral I will be optimal. So, our objective now is to find this special (or optimizing) function y(z). 
To find this optimizing y we form the following Euler-Lagrange equation: 


OF d (OF\ _ 0 

Oy da & 7 
using F’ that we formulated in Eq. 9. We then solve this equation (which in general is a second 
order differential equation) to find the optimizing solution y. Finally, the solution of a second order 
differential equation contains two unknown constants (as a result of two integrations) and hence we 
need the given boundary conditions (i.e. the fixed values of y at the two end points) to find the specific 


solution to our variational problem. 
Note: as indicated in the question, the above description belongs to a simple typical variational 
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problem and hence it can be subject to extensions, generalizations and restrictions in different and 
more complex situations (as will be investigated later on). 

2. State some general properties of the Euler-Lagrange equation. 
Answer: For example: 
e This equation in its various shapes and forms (as investigated earlier and will be investigated further 
later on) is the main pillar of the mathematics of variation (and specifically the calculus of variations). 
e The existence and uniqueness of solution of this equation is not guaranteed in general and hence 
there may not be a solution, or there is only one solution, or there are multiple (finite or infinite) 
solutions. 
e The Euler-Lagrange equation is a necessary, but not sufficient, condition for the existence of optimal 
solution(s) even if solution(s) to this equation do exist. heccrdinely: any obtained solution to this 
equation should be inspected and assessed to determine its nature and if it satisfies the requirements 
and meets the objectives (e.g. searching for a minimum). In this regard, mathematical and non- 
mathematical (e.g. physical) considerations should be taken into account. 

3. Derive the Euler-Lagrange equation using the variational principle. 
Answer: In the following we outline a rather simple derivation method of the Euler-Lagrange equation. 
We start from the functional of Eq. 1, that is: 


b 
Ty] a F(o,y,y') de 


where the values of y at the two end points are fixed (noting that 21 = a and x2 = b are given 
constants). Now, if y is perturbed slightly to y + dy (where dy is a tiny change or variation in y) then 
I should also be perturbed to J + 5I where 6 is given by:!"41 


6 = Ily+ody]—Tly] 


b 
Samal eta 


I 


a OF is 

= [ \Fews) ys Oy ike 
*LOF OF ._, 

= [| Fp8u gard | ae 


b b 
OF OF ., 
[ 5 dy dx 4 Bylo! dx 
‘OF OF. )|° f° d (oF 
= dx 4 d 
i, By oY r ae ‘ dn (5 -) by x 


OF TOF d (OF 
a ) 6 dy| d 
E i i Ee 1 de & | ° 
where in line 3 we use first order Taylor expansion, and in line 7 we integrate by parts the second term 
of line 6. Now, since the two end points are fixed (and hence dy = 0) the first term in the last line is 


zero and hence: 
TOF d (OF ‘lor d (oF 
arm | bw da (=) z] am f ea ate )| 1 


[14] The use of 5 here to symbolize variation is common (and rather conventional although not compulsory) and should 
indicate the difference between functional variation and function variation where in the latter d or 0 are usually used to 
symbolize variation (see Problem 4 of § 1.3). 
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By the variational principle, J should be stationary at its extremum and hence 6J = 0 for all possible 
tiny variations dy in y (ie. dy is arbitrary). This is true iff 


OE AOR Vag 
Oy dx \ dy’) — 


which is the Euler-Lagrange equation. It should be understood that the last equation applies over the 
entire interval (except possibly at a few isolated points). 

4, Referring to the simplified forms of the Euler-Lagrange equation (as given by Eqs. 3-5), it is common 
in the literature to say F' in these cases is independent of (or does not depend on) z or y or y’. Clarify 
this issue. 

Answer: This saying means that these variables do not appear explicitly in the expression of F 
although F is generally dependent on these variables implicitly. 

5. Prove the Beltrami identity (Eq. 3). 

Answer: By the chain rule we have: 


dF OF de | OF dy _ OF dy! 
dx dx dx Oy dx Oy! dz 
5 OR> OF 4 HOF ay 
Ox By 7 © Oy! dx 
_ OF ,. OF dy’ 
~ Oy” yl de 


where the last line is justified by the fact that OF’ /Oxz = 0 because F is supposed to be independent of 
x (i.e. F does not depend explicitly on x). Now, by the Euler-Lagrange equation (i.e. Eq. 2) we have 


on = a (25) and hence on substituting from this into the last line we get: 
y aw \ dy 


a 7 la (ar)|” + apa 


dx dx \Oy! dy! da 
dF d (OF 
oi (5 y’ ) (product rule) 
GT PON, a 
dx dx ay!” 
d OF , 
is (F- 3) = ° 
OF 
F- Y og = C 


Note: the Beltrami identity (Eq. 3) can be obtained more simply from the other form of the Euler- 
Lagrange equation (i.e. Eq. 6) by setting OF /Oz = 0. However, we followed the above method for 
more practice. 

6. Derive Eq. 6 from Eq. 2. 
Answer: We have: 


dF — OF dx | OF dy x OF dy OF OF a OF ,, (hai 
dx Oxdx Oydx Ody dx Ax By ” Oy!” r 

d { ,OF dy OF  ,d (OF ,OF ,d (OF 

dx (u Oy! ) dx Oy! Vz Oy! - Oy! Y ie Oy’ Produce tale) 


Now, if we subtract the first line from the second line we get: 


d ( ,OF dF ,d (OF OF OF , 
dx \¥ Oy! de” dx Oy! Ox By ” 
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OF | d (_,0F OF. 4G fOF\ OF; 
Ox | dx \% Oy! de dz Oy’ Oy 
OF d F jOP\ _ /|OF d (OF 
Ox «dx . Oy’ Oy dx \ dy’ 
OF d OF 
F ; = 
Ox dx ( - Oy’ ) : 


where in the last line we used Eq. 2. 
Note: from the above derivation we can see that Eq. 6 is based on Eq. 2 (since we used Eq. 2 in 
the derivation of Eq. 6). However, strictly speaking Eq. 6 is not another form of the Euler-Lagrange 
equation (as we described it in the text) although it should be equivalent to it. 

7. Show that the Euler-Lagrange equation may be given by the following form (see Eq. 7): 


Fy — Fy —yFyy —y" Fyy = 0 


where the subscripts mean partial derivatives with respect to the variables represented by these sub- 


scripts, e.g. Fy, means a (35 : 


Answer: We have: 


OF d (OF 
By ae (5) = 0 (Eq. 2) 
OF O (OF \ dx O (OF \ dy 0 (OF\ dy’ ; 
Oy Ox (=) dx Oy (=) dx Oy! (=) ae 0 (chain rule) 
OF OF |, OF pO ‘ 
dy  Odxdy’ 7 yoy’ % dy? ~ 
Fy Pye y Fy'y Uf Piyigé — 0 


8. Use F(a, y,y') = x«2y + y” to verify that Eqs. 2, 6 and 7 are equivalent. 
Answer: From Eq. 2 we have: 


O73 12 d 0 3 12 
= 0 
py ute] — ae ay ete] 
d 
3_ _(2y') = 0 
a” — = (2y’) 
x? — 2y" = Q 
From Eq. 6 we have: 
0 d 0 
os [xy y] i (Cara ae [ey +9") SG 
d 
3a7y (ary y? 2y'") _s 0 
d 
3a7y = a (a®y = y'”) = 0 
3a7y — 327y — xy’ 4 2y'y” = 0 
—ay 1 yy" = 0 
a —2y” = 0 (y’ #0) 


From Eq. 7 we have: 


fa) 2 oO? 2 
ay ete") — agg ute" —a' gay yt" "a ey ty] = 0 
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0 
D1’ Dy’ i Dr’ ae 0 
a — 5 byl—y a By] Yay Pu 
zg? —0-0-2y"” = 
2? Qy" = 


So, the three equations are equivalent (at least in this case). 
Note: the three equations will lead to the same result even if y’ = 0 because in this case they will all 
lead to x = 0 (at least as a possibility). 

9. Obtain the Euler-Lagrange equations for the following variational integrands: 


(a) F(z, y,y') = ay*.09l 

(b) F(x, y,y') = ay”. 

(c) F(x, y,y') = yy”. 

(d) F(z,y,y') = ryy’. 

(e) PUG aa) = ety?) /y'. 
(f) F(t,2, 4) = 2/1 + 4.l6 
(g) F(t, 6,0) =tsin@ 

(h) F(a,y,y') = y?e¥ 

(i) F(z,y,y') = 1 ay 

@) Fla,yy) =y? -y?. 

(k) F(a,y,y') oad y? el igo 
() F(z,y,y') = y?/Jy2 + y?. 
(m) F(a,y,y') = ay! + ay. 


(a) y’ does not appear in F' and hence we can use Eq. 5, that is: 


OF 
a 0 
014 
dy (zy’) = 0 
4cy> = 0 
sy? = 0 


(b) y does not appear in F' and hence we can use Eq. 4, that is: 


OF 


Oy! aie 
0 
Dy! (ay?) = "C 
3ay"7 = OC 
oy =D (D = C/3) 


(c) x does not appear in F' and hence we can use Eq. 3, that is: 


OF 
F-y— = 
Oy C 
ea (yy) =. ey 
Oy! 


[15] Despite the absence of explicit dependency on y’ in this expression, we use F(z, y, y’) to highlight the overall dependencies 
(whether explicit or implicit). This is also followed in similar examples and similar missing dependencies (unless stated 
otherwise to highlight the explicit dependencies only). 

[16] We note that the overdot (in this expression and similar expressions) means derivative with respect to t (i.e. d/dt). 
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yy” — y' (2yy') C 
yy? —2yy" = C 
yye e. D 


(d) We use Eq. 2, that is: 


7) j 7) j 
Oy [cyy'] Tn ( Dy [cyy’] 


(e) We use Eq. 2, that is: 


(f) t does not appear in F and hence we can use Eq. 3 (noting that t,x, replaces x, y, y’), that is: 


OF d (dF\ _ 
Oy dx \ Oy’ a 
0 [ety d 0 [ety? _ 
Oy | y! dx \ Oy’ | y' a 
3e7y? d ery? - 
y! dx y’2 i 
Bey? ery? 3e7y7y! Ler yy" _ 
y! y? y2 y's = 
3e7y? ery? Be" y? Ler yy!" = 
y! ' y”? y! y? = 
Ger y? ery? Ler yy! _ 
y! yl? y's ~ 
6y? y? Qy xy" 

y! y!2 y3 a 

OF 

F-¢2 = 
” Oa 


aVl+é #2 (evT Fa) = 


0 


C 


C 


C 


D 


(product and chain rules) 


(e* #0) 


17 


(g) 6 does not appear in F and hence we can use Eq. 5 (noting that t, 6,6 replaces x, y, y’), that is: 


OF 
00 


1.4 The Euler-Lagrange Equation 


0 2 = 
36 (tsind) = 0 
tcos@ = 0 


(h) Only y appears in F' and hence we can use Eq. 5, that is: 


OF 
— = 0 
Oy 
0 
5 ret) = 0 
Qye4+ yer = 0 (product rule) 
y+’ = 0 (e" £0) 
Hence, y = 0 or y = —2 (which is consistent with footnote [13] since y = constant). 
(i) Only y’ appears in F' and hence we can use Eq. 4, that is: 
OF 
2 ee 
Oy! 
0 
eae re 2) = 
Oy’ ( ty 
9 / 
¥y _ C 
2/1+y? 
/ 
eG 
1 } y/2 
y? = C2 a Ory? 


i ee Be 
VI C2 


which is consistent with footnote [12] since y’ = constant. 
(j) x does not appear in F and hence we can use Eq. 3, that is: 


OF 
POY pai = °C 
12 2 KO 12 2 
Grae) voy (y y) = ¢ 
y?—-y-y' (4) = C 
yy? = C 
y?+y = D (D=-C) 


(k) x does not appear in F' and hence we can use Eq. 3, that is: 


OF 
Fovaa = GC 
2 2 
y° —1 0 f(y -il 
( i ) Vou ( yl? ) rt 
2 2 
od Cee 
yf’? if 2 =) melee 
Bian ae (4S =3 4! 
y!? yl? 
2 
yo 1 
= D (D = C/3) 
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Dy? —y*?+1 = 0 


(1) x does not appear in F’ and hence we can use Eq. 3, that is: 


y? ,/ 0 y”? 
/y2 + yl? Oy! ye + yl? 


yo y/ 2y! y'? (2y') 
Jy + y? VP ty? 2y2+y2)°? 


y? Qy"? y* 
Vy? + yf? Vy? + y? (y2 + yl2)3/? 
y? y4 
ye +y? (y2 + yl2)3/? 
y? (y? 4 uy) y"* 
(y2 + yl2)3/? _ (y2 + yl2)3/? 
y2y”? 
(y2 + 2)? 
yty!4 
(m) We use Eq. 2, that is: 
OF d (OF 
Oy dx \ dy 
0 d 0 
bevsini-2 (tev 2m 
di). 
22 — ae (x ) 
2a — 2x 


= 0 
= 0 
= 0 
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This trivial (but true) result is justified by remark 4 in the text because F' is a total derivative of 
x”y and hence the Euler-Lagrange equation is satisfied identically. In other words, we do not have a 
specific function that can be seen as a solution to the Euler-Lagrange equation since any function that 
satisfies the boundary conditions will satisfy the Euler-Lagrange equation. Hence, the Euler-Lagrange 


equation is 0 = 0 and its solution is y = any function (satisfying the boundary conditions). 


10. Obtain the Euler-Lagrange equations for the following functional integrals: 


(a) I[y] = bee (y'? — 4,/xy) dx. 
b) I [x] = fi? (t2? + 4?) dt. 
¢| = iss (tcos @ + t2¢) dt. 
[y] = . (y’? - zy’) dz. 
x)= fe (Pa — =) dt. 

( 


(where a is a constant). 


1 
= fi? (ta? — 20 — ae) dt. 


1 
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G) Tiyl= Se? (ey + yy! +9? + 2y?y’) de. 
(k) Tly] = Sey V1 - y? de. 

(1) Tly] = fo? Bo ae. 

Answer: 

(a) Comparing this functional to the functional of Eq. 1, we see that F (a,y,y’) = y’? — 4,/zy and 
hence the Euler-Lagrange equation (i.e. Eq. 2) is: 


0 d {0 

py ly? — 4ve0] ie ( ay [y? yi] ) = 0 

Ax. d He oe 
Defy de = 0 

2x yo 
Ta 2y° = O 

(i = 

Vry 


(b) Comparing this functional to the functional of Eq. 1 (noting that x,y, y’ correspond to t,x, «), we 
see that F (t,2, 4) = ta? + 4? and hence the Euler-Lagrange equation (i.e. Eq. 2) is: 


0 ae ee d 0 Dav 32 = 
gy iat +24] — 5 (3p le +84]) = 0 
Cie. 
2tx — =, (24) = 0 
2tx-—- 2% = 0 
z—txr = 0 


(c) Comparing this functional to the functional of Eq. 1 (noting that x,y, y’ correspond to t, ¢, d), we 
see that F (¢, d, 6) =tcos¢ + t2¢ and hence the Euler-Lagrange equation (i.e. Eq. 2) is: 


- [icos | 4 “ (= [tcos +124) ) = 0 


d 
-itsng——(t?) = 0 
sin @ ae) 
tsing+2t = 0 


(d) We have F (x,y, y’) = y’? — ry” and since F is independent of y we can use Eq. 4, that is: 


6) 
Dy (y’ eos ay’) —- C 
2y' —2ry’ = C 
y(l-s) = D (D =C/2) 


(e) We have F(t, x, «) = t?4—.x? and hence the Euler-Lagrange equation (i.e. Eq. 2 noting that 2, y, y’ 
correspond to t,x, 2) is: 


0 d {0 
2 (Pe- aa - 2 (+ (Pe a Si 
diay _ 
22-4 (8) = 0 
—224-—2t = 0 


8 
+ 
Hw 
| 
j=) 
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(f) We have F (x,y, y’) = y” — ye*” + y? and hence the Euler-Lagrange equation (i.e. Eq. 2) is: 


0 12 2 d 0 12 2 
ax ax = 0 
py lu ue +a] — a | ay WW — vet +97] 
d 
—e% + 2y— — (2y') = 0 
dx 
et Ey Syl! = 4p 
i yp => 


(g) We have F (x,y, y’) = y’? and hence we can use Eq. 4, that is: 


Oy’? 

Oy! ne 

3/2 = C 

y2 =D  (D=C/3) 


which is consistent with footnote [12] since y’ = constant. 
(h) We have F(a, y,y') = y+ y'y — y’ and since F is independent of x we can use Eq. 3, that is: 


( 12 yy im) V5 (y? +y'y—-y") = © 
yr tyy-y—-y Qy ty) = C 
y? yy y" 2Qy/? yy =, G 
-y” = y? = C 
y?+y? = D (D=-C) 


d(y?/2) 


Note: referring to remark 4 in the text, we note that y/y = ;— and hence it is a total derivative of 
y’/2. We could therefore use the suggested simplification in that remark and obtain the Euler-Lagrange 
equation using F' = y/? — y?, that is: 


0 


Cat as eae Dy (y?-y’) = C 
y?—y’—y (2y') = C 
y2—y?—-%¢? = C 

y?+y = D 


(i) We have F(t, x,#) = ta? — 2x — x& and hence the Euler-Lagrange equation (i.e. Eq. 2 noting that 
x,y,y’ correspond to t,x, @) is: 


7) d {oO 

au [tat? 22 va] di & [tat 2x val) = 0 
d 
2-2 F (2t¢-2) = 0 
2-4¢-(24+4+2t%-4) = 0 
—2-—“4-2¢-22zt+2 = 0 
—2-24-2t¢ = 0 
t¢+e+1 = 0 
2 
Note: referring to remark 4 in the text, we note that rz = a) and hence it is a total derivative of 


x /2. We could therefore use the suggested simplification in that remark and obtain the Euler-Lagrange 
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equation using F = tz? — 22, that is: 


(j) We have F(z, y,y’) = xy + yy’ + y? + 2yy’ and hence the Euler 


d (0... 
22] -P (+ [ta 21] 
d ; 
a ae (2ta) 
—2 — (Qu + 2tz) 
—2— 24 — 2tz 


ti+a+ 1 


d 


ue [ay + yy! 
Oy 


Note: referring to remark 4 in the text, we note that yy’ is a total derivative 


y" 2y*y'] 


z+y! +2y + 4yy’ — y' — dy’ 
x + 2y 


d 


x + 2y 
y 


I 


I 


a a ) 


0 


d 
x+y! + 2y + dyy! — — (y + 2y*) 


0 


x 
2 
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-Lagrange equation (i.e. Eq. 2) is: 


of y?/2 and 2y7y’ is a 
total derivative of ie We could therefore use the suggested simplification in that remark and obtain 
the Euler-Lagrange equation using F = xy + y?, that is: 


a ne ar : 
5 bu tv]-2(y [ev +17) 


(k) We have F(z, y,y’) = /1— y? and hence the Euler-Lagrange equation (i.e. Eq. 4 because y does 


not appear in F’) is: 


0 
eal = C 
—y' _ 
raga 
C2 
12 — 
a or. 


which is consistent with footnote [12] since y’ = constant. 


12 
(1) We have F(z,y,y') = fay 
12 ra) 12 

yy y YY =e 

1+ yy’ Oy \ Aye! 
yy’ FT 2yy’ yy? (y) en 

1+ yy’ I+yy (1+ yy’)? 
yy’? ayy? vy lg 

ltyy ltyy (1+yy') 


and since F is independent of x we can use Eq. 3, that is: 
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yy’? yy!’ 

I+yy (1+yy')? 
yy*(1+yy') .  y’y”® 

(l+yy’)? (1+ yy’)? 
yy 

(1+ yy’)? 

yy’? 
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C 


C 


D(1+yy')” (D = -C) 


11. Find the extremizing (or stationarizing) functions of the following functional integrals and verify the 


results: 

(a) I[y] = Ne a y!? da: 

(b) Ix] = f? (a? +4?) dt. 

(c) I[y] = oe rv/1— y’? dx. 

(d) I ly] = fr? (y? — ey) de. 

(e) I[r] = he (Sr? —cos@) dé 

(f) I fy] = ie (y’? —aaxy’) dx 

(g) Ty] = fr? (y"? — 2?) de. 
(h) I [y] = f2? (y? — ky) de 

(i) I ly] =f? (y? — 2y + 5a) de. 
(3) ly] = Se? (y? + 2yy! — y?) de. 
(k) I [yl = fr (y? + 2yy’ + 4y?) da. 
(1) I[y|= a Vet y) de. 


Answer: 


(where the prime means d/d@). 


(where a is a constant). 


(where & is a constant). 


(a) Comparing this functional to the functional of Eq. 1, we see that F = x?y’?. Now, since F is 


independent of y we can use Eq. 4, that is 


O 
By? (a?! 


2x?y 


*) 


/ 


C 
C 


This is the Euler-Lagrange equation!!”] which we solve as follows: 


dy = = = C 
dx. 23 
_ C 
y= pa 


which is the extremizing function. 


+ D 


(D is a consatnt) 


To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


a: 
C 


C 
C 


(171 When we say “This is the Euler-Lagrange equation” here and in similar contexts, we mean “for the specific problem”. So 
in fact, it is an instantiation of the Euler-Lagrange equation in its general form (as given by Eq. 2 and its variant forms). 
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(b) Comparing this functional to the functional of Eq. 1 (noting that x,y, y’ correspond to t, x, a), we 
see that F = x? + «?. Now, since F is independent of t we can use Eq. 3, that is: 


(2? +47) —& a (a? +a°) = 


C 
Ot 
o?+é%-—#¢(2¢) = C 
a+e*—-2¢ = C 
gg = OC 
ze = 2’-C 
This is the Euler-Lagrange equation which we solve as follows: 
Lr = pa 2 = C 
dx 
pclae eee ee ep ie 
dt sre 
d 
z = ctdt 
x2—C 
In ( g2—-C+ x) = it+D (integrating both sides) 
a?—C+a = ett? 
x2—-C+az = Ee* (E =e?) 


So, the extremizing function x(t) is given implicitly by the last equation. 
To verify the result we show that the last equation is equivalent to the Euler-Lagrange equation, that 
is: 


pace ee Se 


In (v x2—-C+ x) = InE+t (taking natural logarithm of both sides) 


(see +1) dx 
Va? —C+a dt 


z+Vx2—C 
Vee dx 


Vat —C+a dt 


t1 (taking derivative of both sides with respect to t) 


lI 

LU 
T 

— 


8 
II 
Ht 
_ 


on a oA 


x xv —C 


which is the Euler-Lagrange equation. 
(c) We have F(z,y,y’) = 2\/1— y”, and since it is independent of y we can use Eq. 4, that is: 


5a (evi-v?) = © 
xy’ C 


Ji-y2 


This is the Euler-Lagrange equation which we solve as follows: 


xy’? = C? _ C7y/2 
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C2 
2 
yO +e 
! +1 
0 
(a/C)° +1 
: a 
y = +Carcsinh (=) + D 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


— 


+Carcsinh (4) + DI 2 


x 
C 
v 1 — [4Caresinh (S) + D]” 


1 
if lactien| 2 


[aah] 


8 
iw) 
o 
~ 
3 
+ 
BR 
Ils 


C2 


= CC? 


C2 


C2 


OS e 


(d) We have F(z, y,y’) = y’ — xy and hence the Euler-Lagrange equation (Eq. 2) is: 


0 12 d 0 12 reat 
ay tel oe (a [y"—zy]) = 0 
d 
—7 — — (27') = 
x — ~~ (2y') 0 
-x-2y" = 0 
Qy" +a” = 0 


This is the Euler-Lagrange equation which we solve as follows: 


1 
a 52 
yo = es, 
4 
1 
y = —Fgt +Cz+D 


which is the extremizing function. 
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To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


1 1 
2(-G2" +24) +2 = 0 


1 : 2 
2(-727+C40) z= 0 


mere 2-10 
mae +r = 


—“+xz = 0 


(e) We have F'(6,7r,r’) = ar? —cos@ and since F is independent of r we use Eq. 4 (noting that 2, y, y’ 
correspond to 6,r,7’ and the prime means d/d6), that is: 


2] 
Aas 
Nl eR 
3 

v 
| 

lo) 

oO 

DN 

Dp 
eee 
lI 


This is the Euler-Lagrange equation which we solve as follows: 


dr 
FT C 
r = C@0+D 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 
(Co+D) = Cc 
(C +0) C 
C= C 


Il~ 


(f) We have F(z, y,y') = y’? — ary’ and since F is independent of y we can use Eq. 4, that is: 


0 
dy! (y’ - axy') = C 
2y’-ar = C 


This is the Euler-Lagrange equation which we solve as follows: 


ea: Ate 
c SND 
C 
y = get aetD 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


2 Be it Oda Lae = C 
4 2 a 


a C ? 
2(Se+S +0) — ax = «¢ 


ax+C-ar = C 
C= €C 
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(g) We have F(x, y,y') = y'!/? — x'/? and since F is independent of y we can use Eq. 4, that is: 


io (yt? = 24?) rs: 
wu? = GC 


This is the Euler-Lagrange equation which we solve as follows: 


yf A = 2 
y = 4C?2 
y = D (D = 1/[4C")) 
y = D«+E 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


2? 


1 
(Dey ey 


: C 
1 1/2 2 

5 (D +0) £6 

oe = C 

520 = C 

C=C 


(h) We have F(z, y,y’) = y” — ky and since F is independent of « we can use Eq. 3, that is: 


0 
12 k / 12 k = 
(uy —ky) —y Dy (y° — ky) C 
y”? —ky—y' (2y') = C 
—ky-y? = C 
ky+y? =D (D=-0) 


This is the Euler-Lagrange equation which we solve as follows: 


5 hy 


aE = dz 
2 
oe D-ky = «+8 
1 k2 ; 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


( jo-(e+2)']) +(F [>- Kier my]) ae 


p-* @+By ( 0 ia (a 4 pl) 


Il~ 


D 
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k? k ‘ 
D- (+B +(-F (+8) = D 
ke re , 

D 7 @+£y +7 @+ 28) = D 
D = D 


(i) We have F(x, y,y’) = y’ — 2y + 5a and hence the Euler-Lagrange equation (i.e. Eq. 2) is: 


O d 0 
Dy [y’* — 2y + 52] i (= [y? 2y +51] ) = 0 
d 
-2- = (2) = 0 
y’+1 = 0 


This is the Euler-Lagrange equation which we solve as follows: 


y" _ —]1 
y = -#+C 
2 
y= —5+Cr+D 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


~ 


(-#+C+0) +1 = 0 
(-1+0)+1 0 
-1+1 = 0 


I~ 


(j) We have F(a, y,y') = y’? + 2yy! — y? and hence the Euler-Lagrange equation (i.e. Eq. 2) is:!'8] 


0 12 / 2 d 0 12 / 2 
a 2 +2 = 0 
ay | + 2yy' — y?] o ayt lv yy’ —y"| 
d 
2y’ — 2y — — (2y/+2y) = 0 
dx 
Qy’ — 2y—2y"—2y' = 0 
y ty = 


This is the Euler-Lagrange equation whose solution is: 
y =asinz + bcosx (a and b are constants) 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


Mu 


(asing + bcosx) + (asina + bcos x) 0 


0 


Ils 


Il~ 


(acosaz — bsinx)’ + (asinx + bcos 2) 


[18] Referring to remark 4 in the text and noting that 2yy/ is a total derivative of y2 we can obtain the Euler-Lagrange 


equation using F = y’? — y?. 
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Il~ 


(—asin x — bcos) + (asin x + bcos a) 
0 => 


(k) We have F(z, y,y’) = y’? + 2yy’ + 4y? and hence the Euler-Lagrange equation (i.e. Eq. 2) is:!1% 
OSs a5 d ( 0 
oN >) / 4 2 12 ) / 4 2 = Q 
jy Lu” + uu! + 49") a(au yy’ + Ay?| 
d 
Qy! — — (2y' +2 a 
y + 8y — = (2y' + 2y) 0 
Qy’ + 8y — 2y"—2y' = 0 
y"—4y = 0 
This is the Euler-Lagrange equation whose solution is: 
y = asinh(2x) + bcosh(2z) (a and b are constants) 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is: 


[asinh(2x) + bcosh(2z)| fies 4/asinh(2x) + bcosh(2z)| 2 
[2a cosh(2x) + 2b sinh(2a)]' — 4[asinh(2z) + bcosh(2z)| 7 


[4a sinh(2x) + 4b cosh(2x)] — 4/asinh(2x) + beosh(2z)] = 
0 = 


(1) We have F(a, y,y’) = «(1+ y”) and hence the Euler-Lagrange equation (i.e. Eq. 4 since F’ has 
no explicit dependency on y) is: 


~ 
OS oO oO 


dVa(1+y") 
— oy 
yfe 
/L+y/2 et 
yx 7 C2 (1+y7) 
C2 
12 = 
gO SS eae 
2 
i Se ae (10) 


This is the Euler-Lagrange equation whose solution is: 


y = +2\/C? (a4 —C?)+D 


which is the extremizing function. 
To verify the result we substitute from the last equation into the Euler-Lagrange equation, that is:!?° 


Ig C2 

[+2 C? (@—C?) + D| => =x Ree 
QVC? ? C2 
“peo +? ~ *V¥2-@ 

C? C? 

“Va-@ ~ “Ve-@ 


[19] Referring to remark 4 in the text and noting that 2yy’ is a total derivative of y? we can obtain the Euler-Lagrange 
equation using F = y’? + 4y?. 
[20] Th fact, the result can be verified more easily by just differentiating y with respect to x to obtain Eq. 10. 
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12. Find the extremizing (or stationarizing) functions of the following functional integrals as well as the 
specific solutions for the given boundary conditions: 


(a) I[y] = f? ge + ky?) da with y («1 = 0) = 0 and y (a2 = 1) = 1 (k > 0 is a constant). 
(b) I [y] = e (cy — y'?) dx with y (%1 = 0) = 0 and y (a2 = 1) = 12. 

(c) I ly a ae with y (a1 = 2) = 1 and y (a2 = 4) = 31. 

(d) I[y] = ee ee dx with y (x1 = 0) = V2 and y (x2 = 1) = 1. 

(e) I[y] = ee (y’? — y? + ycosha) dx with y (a1 = 0) =0 and y (a2 = 7/2) = 1. 

(f) T[y] = fo, (v2 +y? —4ycosx) de — with y (a, = 0) =1 and y(#2=7) = 1. 

(g) I ly] = A (y’? — y? — 2axy) dex with y (#1 = 0) =1 and y(r2 = 1) =2. 

(h) I [y] = i (y? — ay) dx with y (a1 = 0) = 0 and y (x2 = 1) =3. 

Answer: 


(a) Comparing this functional to the functional of Eq. 1, we see that F = y’? + ky? and hence from 
the Euler-Lagrange equation (Eq. 2) we have: 


0 12 2 d 0 12 2 
+k +k = 0 
ay lu + hu] — ae ay ly? + By] 
d 
2ky — = (2y') = 0 
Qky —2y"” = 
y—ky = 


So, the solution is y = Ccosh(Vkx) + Dsinh(Vka) which can be checked by substitution into the 
Euler-Lagrange equation, that is: 


[Ccosh (Vk) + Dsinh (Vkx)]" — & [C cosh (Vk) + Dsinh (Vix) | 
[VEC sinh (Vir) + VED cosh (Vi =\| - k [C cosh (Vk) + Dsinh a 2 9 


[ac cosh (Vix) + kDsinh (Vkx) | —k le cosh (Vie) + Dsinh (Vie) 


Now, from the condition y (#; = 0) = 0 we get C = 0 while from the condition y (x2 = 1) = 1 we get 
D =1/sinh(Vk) and hence the specific solution is: 


7 sinh (Vix) 
- sinh (vz) 


Note: this solution (with k = 1) is plotted later in the book (see Figures 69, 77 and 78). 
(b) We have F = xy — y” and hence from the Euler-Lagrange equation (Eq. 2) we have: 


- <= (-a/) = 0 
r+2y” = 0 
dy x 
dx2—« 
dy ; 


1.4 The Euler-Lagrange Equation 31 


3 
y = 7 +Czx+D (C and D are constants) 


Now, from the condition y (x; = 0) = 0 we get D = 0 while from the condition y (x2 = 1) = 12 we get 
C = 145/12 and hence the specific solution is: 


—a3 + 1454 
: 12 


(c) We have F = y’?/x?. Now, since F is independent of y we can use Eq. 4, that is: 


ra) y? 

mls] = ¢ 
Qy! 
wee 
dy _ Cy 
dx 2 
y= Sot+D 


Now, from the condition y (x; = 2) = 1 we get 2C + D = 1 while from the condition y (x2 = 4) = 31 
we get 32C'+ D = 31 and hence C' = 1 and D = -1. So, the specific solution is: 


a4 


Se 
y= 8 


Note: this solution is plotted later in the book (see Figures 71 and 76). 
d) We have F = sae ie and because it does not contain y we can use Eq. 4, that is: 
z y 


aly?) _ 
Oy! x 
y! 
St 
a /1lt+y? 
y2 = C22? Gis y?) 
ie _ C272 
DO SE Ote? 
fe ome he Cx 
H af Crge 
1 
y= FaVvil- Cx? +D 
1 
2 
g+(y—-D)y = ae 


Now, from the condition y («1 = 0) = V2 we get (v2 — D)° = @ while from the condition y (72 = 1) = 
1 we get 1+ (1—D)* = az and hence C = 1/V2 and D = 0. So, the specific solution is: 


ety =2 
(e) We have F = y — y? + ycoshz and hence from the Euler-Lagrange equation (Eq. 2) we have: 


a 
ay lv 


5, WW? — + yoosha! = ( (a i? + yoosha] = 0 
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2y + cosha 7 (2y’) 
dx 


—2y + cosha — 2y” 
1 
y+y- 5 cosh a 


I 


0 
0 


32 


So, the solution is obviously a combination of sinusoidal and hyperbolic functions (i.e. y = C cosa + 


Dsing + + cosh x) which can be checked by substitution into the Euler-Lagrange equation, that is: 


1 7 i 1 
[Ceose + Dsine + zon] + [Ceose + Dsine + 7 eosh| _ 5 cosh x 


1 : 1 ie 
[Caine + Deose + zsinna| + [Ceose + Dsine + 7 eosh| - 5 cosh 


1 1 1 
[Coos Dine + jooshe| + [Crcose + Dsine + josh | _ 5 cosh a 


0 


0 
0 


Now, from the condition y (x; = 0) = 0 we get C + + = 0 while from the condition y (x2 = 7/2) = 1 


we get D+ ; cosh $ = 1 and hence C = —+ and D=1-—- ; cosh 5- So, the specific solution is: 


1 1 1 
y= 7 os 4 (1 zoosh 5) sina {cosh 


Note: this solution is plotted later in the book (see Figure 72). 


(f) We have F = y’ + y? — 4ycosz and hence from the Euler-Lagrange equation (Eq. 2) we have: 


5 [y’? + y? — 4y cosa] < (= [y'* + y? — 4y cos ‘]) 


d 
2y — 4cosx — — (2y’) 
dx 


2y — 4 cosa — 2y" 
y" —y+2cosx 


0 
0 
0 


So, the solution is obviously a combination of hyperbolic and sinusoidal functions (i.e. y = C cosh a + 


D sinh xz + cos) which can be checked by substitution into the Euler-Lagrange equation, that is: 


[C cosh x + D sinh x + cos 2] es [C cosh x + D sinh x + cos x] + 2cos x 


[C sinh « + Dcosh x — sin |’ — [C cosha + Dsinhx + cosa] + 2cos x 


[C cosha + Dsinhx — cos] — [C cosha + Dsinhx + cosa] + 2cos x 


0 


Il 


Ils 


(ae = ee > a 


Now, from the condition y (x1 = 0) = 1 we get C+ 1 =1 while from the condition y (x2 = 7) = 1 we 


get Ccosha + Dsinha —1=1 and hence C = 0 and D = 2/sinh7z. So, the specific solution is: 


( ) 
y= - sinh x + cos x 
sinh 7+ 


Note: this solution is plotted later in the book (see Figure 80). 


(g) We have F = y’? — y? — 2zy and hence from the Euler-Lagrange equation (Eq. 2) we have: 


0 12 2 
By ly -y 


d 0 


0 
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13. 


d 
—2y — 2a — a (yo). =O 
—2y—2x4-2y” = 0 
y’+ty+x = 0 


So, the solution is obviously a combination of sinusoidal and polynomial functions (i.e. y= Ccosx + 
Dsin«x — x) which can be checked by substitution, that is: 


[Ccosa + Dsina — a]" + [Ccost+ Dsing—2])+2 = 


[ — Csina + Deosa — 1]'+ [C cosx + Dsinx — a] + 


T 
8 
I 
oo oe 


[-—Ccosx — Dsinz —0] + [Ccosz+Dsinz-—az] +a = 
0 = 0 


Now, from the condition y (x; = 0) = 1 we get C+ Dx 0—0 = 1 while from the condition y (x2 = 1) = 2 
we get C'cos1+ Dsin1 — 1 = 2 and hence C = 1 and D = (3—cos1)/sin1. So, the specific solution 


1s: 
=) ; 
y =cosx + 7 sinv— 2x 
sin 1 


Note: this solution is plotted later in the book (see Figures 70 and 79). 
(h) We have F = y? — xy and hence from the Euler-Lagrange equation (Eq. 2) we have: 


) d ( 0 
oy Pl ae G lw 2) ie 
d 
a a Po! = 
2y—2x 7a 10) 0 
Qy—2? = 0 
_@ 
a= ss 


Now, the first boundary condition y (x; = 0) = 0 is satisfied by this solution but the second boundary 
condition y (xz = 1) = 3 is not. Hence, this problem has no solution (i.e. specific solution for the given 
boundary conditions). 


Find the Euler-Lagrange equation that associates the functional I [6] = [ MG G - 56) dt (with B 


being a constant) and investigate its solution. 
Answer: We have F(t,,0) = 6? — 30? and hence the Euler-Lagrange equation (i.e. Eq. 2 noting that 
x,y,y’ correspond to t, @, 6) is: 


ws 90°] - 5 (5 [#-se]) = 0 
~280 — ‘ (26) 

—286 — 26 

6 + BO 


| 
ro) 


l 


Regarding its solution, we have three cases: 
(a) If 6 < 0 then we have 6 — |8|0 = 0 whose solution is obviously hyperbolic of the form 0 = 


acosh ( /| 8 t) + bsinh ( /\3| t) (with a and b being constants). 


(b) If 6 = 0 then we have 6 = 0 whose solution is obviously linear polynomial of the form @ = at + b 
(with a and b being constants). 


(c) If 6 > 0 then we have 6 + |8|@ = 0 whose solution is obviously sinusoidal of the form 6 = 


a COs (v vel t) + bsin (v |B t) (with a and b being constants). 
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1.5 Variational Problems with Higher Derivatives 


In some variational problems the functional integral J depends also on derivatives of the extremizing 
function y of higher orders. In such cases, more terms should be added to the Euler-Lagrange equation 
to account for this extra dependency. For example, if J depends on the second order derivative y” as well 
and hence: 


ius / Piao ae (11) 


[where the values of y and y’ at the end points are given by the conditions y(a1) = C1, y(a2) = Ca, 
y' (a1) = C3 and y' (xz) = C4] then the Euler-Lagrange equation!!! becomes: 


OF d (OF d-f-0F 
| - 12 
Oy dx (=) dx? (sr) : ) 


th 


This can be generalized if J depends on derivatives higher than the second (up to the n“” derivative) as 


well, that is: 


= Cl): ate (n) | v = 
Ily] = [ F (x.y 1Y ) dx and t S (—1) ; ( ) =0 (13) 


i=l 


where y, y(% ,y are the 1%, n*”, it” derivatives of y, ie. yO = dy/dax, y™ = d"y/dx” and yO = 
d'y/da". 
Problems 
1. What is the Euler-Lagrange equation when: 
(a) F(x, y,y',y") = yy" + ay”. 
(b) F (1,0,6,6) = 2sind + 62 + 36. 


(c) F(z,y,y',y") = ay” — y? + cry (with a and c being constants). 
(d) F(s,¢,¢',¢") = ¢'? +ad’ —bs*6+ cs? (with a,b,c being constants and the prime means d/ds). 
Answer: 


(a) Using Eq. 12 we have: 


) d {0 d? a) 
bat £ (fw earl) Sal fplarenn) = 
d d? 
" \ | = 
yo — Z (2ay') + Ta (y) 0 
y" (2y/’ 2ry"") | y" pee, 0 
Qy” ay Qy’ _ Qxy" = 0 
y'(l-a)-y' = 0 
(b) Using Eq. 12 (with x,y, y’,y” corresponding to t, 6, 6, 6) we have: 
a ee Ne a ee ee ce ao ae ae 
ap [2829 + 8 + 38] — — (= [2sing + 6 4 34] ) bes [2sing +6 +36]) = 0 
d /.. d? 
2eos) — = (26) +5 (3) = 0 
2cosd—20+0 = 0 
6—cosd = 0 


[21] Although we call this equation (and its alike) the Euler-Lagrange equation it is not really the Euler-Lagrange equation 
but an extension (or generalization) of it. In fact, in some texts this equation (and its alike) is labeled differently (e.g. 
Euler-Poisson) but we keep our labeling (i.e. Euler-Lagrange equation) for simplicity and to avoid possible confusion. 
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(c) Using Eq. 12 we have: 


a) 2 d (0 Be (ef O 5 
ee oe (a lay" 4 cxi)) + aaa \ gyn lau” a + eeu] ) = 0 
/ a? 
cE a (—2y’) + 7 (a) = 0 
ce +2y”"+0 = 
2y"+cr = 


(d) Using Eq. 12 (with x,y, y’,y” corresponding to s, ¢, ¢’, 6”) we have: 


0 
do 


[¢"? + ad! — bs”b + cs] d ( @ [87 + ad! — b5%0 +055] ) 


ds \ 0g! 
pee ( : [o"? + ag! bs?0 + 085] = 0 


ds? \ Ag" 
d a? 
2 " = 
Bar Mate AAO) oD 
—bs? -0+4+29¢ = 0 
264 —bs? = 0 


2. Obtain the Euler-Lagrange equations for the following functional integrals and solve them: 
(a) I[y] = Jor y (2—y") de. 
(b) I[y] = fF? (zy ty? — 2?y") da. 
(c) I [6] = Mi (26 492 — w?6?) dt (with w being constant). 
(d) I fy] = hi (y" +ay’? — B4) dx (with a and @ being constants). 
Answer: 
(a) In this case we have F (z,y,y',y”) = y(2—y”) and hence the Euler-Lagrange equation (i.e. Eq. 
12) is: 


0 ” d d) ” i d? 0 ” = 

a we—vl- = (gzbe-vl) +a (grwe-vl) = 0 
d d? 
e-v)-4@+5-y = 0 
a ee | 
1-y" = 0 
dy 

dz + 


On integrating the last equation twice we obtain the solution, that is: 


dy 
eel ee C 
as x+ 
1 
y = 5t +Cx+D 


(b) In this case we have F (x,y, y’,y”) = xy + y? — xy” and hence the Euler-Lagrange equation (i.e. 
Eq. 12) is: 


0 d {oO d? 6) 
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Boe (2y’) + ae (-2?) = 0 
x—2y”"-2 = 0 
dy 1 

ue 1 


On integrating the last equation twice we obtain the solution, that is: 


dy 1, 
ae eer 
ene ee 
y= =e? —--2°4+Car+D 


12 2 


(c) In this case we have F(t,0,6,0) = 26+ 6° —w?6? and hence the Euler-Lagrange equation (i.e. Eq. 
12 noting that x,y, y’,y” correspond to t, 6,0, 6) is: 


[20 +6? -a%6"] - 4 (2 [20+ 02 #)) as (5 [2+ 02 6) oe, 


Oe dt \ a6 dt? \ a6 
| oe a 
2 | _ 
299 — © (26) + =5 (2) = 0 
—Ww*9—264+0 = 0 
6 + w26 = 0 


So, the solution is 6 = C'cos (wt) + Dsin (wt) (with C and D being constants) as can be checked by 
substitution into the Euler-Lagrange equation, that is: 


2 


a [C cos (wt) + D sin (wt) ] + w?[C cos (wt) + D sin (wt) ] 


l| 
o 


£ [ — wC sin (wt) + wD cos (wt) ] + w?[C cos (wt) + Dsin (wt) | 25.6 
[—w?C cos (wt) — wD sin (wt)] + w? [C' cos (wt) + D sin (wt)] eg) 
0 = 0 


(d) In this case we have F (a,y,y’,y”) = y’ + ay” — 8% and hence the Euler-Lagrange equation (i.e. 
Eq. 12) is: 


afm 2 Yy d a mn 2 y d? 0 " 2 y 
Oy ly ee 8 dz \ Oy’ ly Cee 8 ' dx? \ dy” ly sey =, oe 
Bd eg a 
~F _ ony" +0 = 0 
B 
as = 
ES 2ax : 


So, the solution is y = & (clna — 2) + Ca + D (with C and D being constants) as can be checked 


by substitution into the Euler-Lagrange equation, that is: 


2a 


| F (m+? 1) +040] +52 Be 


[- Hwee 2) +Cx+ D| B = 0 
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3. Find the extremizing (or stationarizing) functions of the following functional integrals as well as the 
specific solutions for the oon boundary conditions: 


(a) I [y] = {2 (a?y" + y’? — y*) dex with y(m) = 2 and y (37) =5. 


(b) I [yl = fo (y? + 7202?y) de with y (0) =0, y(1) =0, y’ (0) =1 and y/ (1) =1. 

Answer: 

(a) We have F = x7y" + y’? — y? and hence from the Euler-Lagrange equation (Eq. 12) we get: 
0 2,/1 2 d 0 2,// , ,/2 2 ge ke 0 2,1 | ,/2 2 = 
gy ee te —¥)— Flay eu te - 01) + Ge ae vl) = 0 

d Ps 
—2y — oy! +2 = 
y’+y-1 = 0 


Hence, the extremizing function (which can be verified by substitution in the last equation) is 
y=Ccosx+ Dsing+1 


From the given boundary conditions (respectively), we get: 


Ccos7+ Dsinz7 +1 = 2 > C=-1 
Coos = + Dsin = 41 = 5 > D=-—4 


Therefore, the specific solution is: 


y =—cosx—4sing+1 


(b) We have F = y'” + 720x?y and hence from the Euler-Lagrange equation (Eq. 12) we get: 


) 9 d {0 2 d? 3) 2 3 
Bi [y* + 720x7y] — — = (a Dy! [y? + 7202 *il) a2 | aye [y”? +720x7y]} = 0 
d a? 
7202” — — (0) + = 0 
a — 7 (0) + 75 (2y") 
7202? +2y4 = 0 
y= 3602? 
On integrating 4 times we get the extremizing function: 
y= x t C3x° } Cox? + Cix+Co 
From the given boundary conditions (respectively), we get: 
0+0+0+0 Co = 0 =F Co =0 
14+C34+C.+C,;+0 = 0 — C3+C2+C,=1 
0+04+0+C, = 1 > Ci =1 
—64+3C3+2C2+1 = 1 > 3C3 + 2C2 =6 


Accordingly: Co = 0, C; = 1, Co = —6 and C3 = 6. Therefore, the specific solution is: 


y = —a° + 62? — 62? +2 
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1.6 Variational Problems with Multiple Independent Variables 


In some variational problems (i.e. multi-dimensional problems) the extremizing function y depends on 
more than one independent variable (unlike the previous problems where y depends solely on x). In such 
cases, the Euler-Lagrange equation is modified to account for the additional dependency. For example, if 
y = y(a1, #2) and hence:!??! 


Ily]= | F (21,22, Y; Yor) Yeo) x1 dre (14) 
Q 


(where y,, = Oy/Ox, and y,, = Oy/Ox_ and where y is identified on the boundary OQ of the domain 2Q 
of the integral) then the Euler-Lagrange equation!?*]_ becomes: 


OF 0 OF 0 OF 
= 1 
Oy Oxy (5,..) 0x2 (5) : oe 


It is very important to note that the notation in the formulation of variational problems with multiple 


independent variables is rather misleading. The reason is that: based on the derivation of Eq. 15 (and 
its generalizations) the partial derivatives with respect to the independent variables (i.e. ay and a5) 


apply to all the dependencies of their operands (i.e. Bi and Is) on the independent variables (i.e. 


x, and x2) whether these dependencies are explicit or implicit, and hence from this perspective they 
are like total derivatives. However, they are not notated as total derivatives because the expressions in 


these formulations have other dependencies and hence ro is not appropriate due to the existence of x2 


dependency (and similarly in is not appropriate due to the existence of x; dependency). In fact, some 


texts distinguish these partial derivatives by the notation Be and Bo and hence Eq. 15 is written as: 


OF D (OF D (OF 
— 1 
Oy Dx (5) Dax (5) : a 


The effect of this difference in the meaning of the notation will be seen in the solution of some Problems 
of this section. For example, in the solution of part (a) of Problem 2 we did not set a (yx) and Zo (yz) to 
zero (as we usually do in the formulations of the Euler-Lagrange equation where we consider only explicit 
dependencies), but we did (for the purpose of clarification) as follows: 


S (we) Sw) = 0 


O {Ody O (Oy a, & 
Ox \ Ox Oz \ Oz a 


ay Fy _ 4 
Ox? Oz? 
Yar — Vez =F 0 


As we see, the derivative 2. acts even on the implicit « dependency of y, (and similarly for # which acts 
even on the implicit z dependency of y-). 

Problems 

1. Write down the functional integral J and the Euler-Lagrange equation when the extremizing function 


y depends on n independent variables x1,--- ,2n. 
Answer: 


Tl = fof F (a1,-°- »Un Ys Ya45°°° ie) dx,++- dan 
Q 


[22] The reader should note that the functional in this case takes the form of a double integral on a given 2D domain Q. 

[23] Although we call this equation (and its alike) the Euler-Lagrange equation it is not really the Euler-Lagrange equation 
but an extension (or generalization) of it. In fact, in some texts this equation (and its alike) is labeled differently 
(e.g. Euler-Ostrogradsky) but we keep our labeling (i.e. Euler-Lagrange equation) for simplicity and to avoid possible 
confusion. 
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OF 


dent Variables 


n 


OF 


Oy 


where yz, = Oy/Oa;. 


0 
2, Ox; ( OY; 


) =0 


2. Find the Euler-Lagrange equations for the following functional integrals: 


(a) I ly] = fq (v2 — yz) da dz. 

(b) I ly] = fl (v2 + y2 + Cy) dt de. 
(c) I[y| = We (arys - az*y?) dx dz 
(d) T(E] = fg (2 + &] +. 0€) da dy 


Answer: 


(with a being a constant). 


[with a being a function of x and y, ie. a = a(x,y)]. 
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(a) Using Eq. 15 with F(x, z,y,Yx,yz) = y2 — y? (noting that 71, 22,Y,Yx,;Yx, in Eq. 15 correspond 


to ©, 2,Y, Ux, Yz here) we get: 


0 [0 


Of 0.25 


( OYe 


(2 -¥4]) 


( Dyn [vi #4)) 


6) ) 


2 (Yx) — 2 (yz) 


Oy O (Oy 
Ox Oz \ Oz 
Oy _ Oy 


Ox? 02? 
VYru a Yez 


Oz 


0 
Ox 


= 0 
= 0 


b) Using Eq. 15 with F(t,2,y,y,y2) = y? + y2 + Cy (noting that x1,22,Y,Ye,;Yx, in Eq. 15 
t x 1 2 


correspond to t,x, Y, Yt, Yx here) we get: 


Oty Bia Of Fp 8 0 ne eee = 
By Lyi + ¥2 + Cy] — 5 & [ye +¥2 + Cy] } — 5 Dyn [wi ty2+Cy]} = 0 
O ) 
C— — (24) — — (2y2 0 
© (244) — (Oye) 
6) O C 
OL (ye) + du (Yx) = 2 
O [Oy O (oy at SE 
Ot \ Ot Ox \ Ox ~ 2 
Oy O7y 2 °C 
at2 ' Ox2 —— 
which is a 2D Poisson equation. 
(c) Using Eq. 15 with F(z, z,y,Yx,yz) = 2?y2 — az7y? (noting that 71,272,y,Yr,,Yr, in Eq. 
correspond to x, 2,Y,Yx,Yz here) we get: 
O12 9 2,2 0 O 729 2,2 nee 2,2 
Oy [x Ya az yi | Ox OYe [z Ya az y? | Oz Oye [z Ya az ys i 0 
0 
0- an (227 ye) ae (—2az7y.) = 0 
—2 (22yx + ies) + 2a (2zyz +2 Yzz) 0 
2LYx £7 Yon + 2azyz az Yr —-/0 


15 


(d) Using Eq. 15 with F(x,y,€,€:,€)) = € + € + a€ (noting that 11,22,4,Yr,,Yey in Eq. 15 
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correspond to 2, y,&,&2,€y here) we get: 


6) ) 0 6) O 
xtra - 2 (letra +ot])-F (ge le+g+a]) = 0 
O ) 
a— a (22) — ay (244) = 0 
a — xe — UWyy = O 
fea + yy = 5 
OPE Or x, 
Ox? ' Oy2 2 


which is a 2D Poisson equation. 

3. Find the Euler-Lagrange equation for the functional I [y] = | In (y2 + y?) dx dz and suggest a solution. 
Answer: Using Eq. 15 with F(2,z,y,y2,yz) = y2+y? (noting that 71,72,Y,Ye,,Yr. in Eq. 15 
correspond to x, 2,Y,Yx,Yz here) we get: 


ar ree | ence can ee Sr 1) eemceaT (CR 
0 
0- = (ye) — 5 Que) = 0 
dO (Oy\ | O (dy ae Gi 
dz \Ox)] ' Oz\ Oz) — 
Ory dy 
be on = 


which is a 2D Laplace equation. One possible solution is y = xz because: 


oO” oF O 6) 


Note: it should be obvious that the independent variables x and z are independent of each other. 

4. The area of a simple surface in 3D Euclidean space with a given domain ( and a given closed boundary 
curve is to be optimized.|4l Find the Euler-Lagrange equation for this problem. 
Answer: Let the surface be given as z = z(x,y) over the domain 2 (in the zy plane) with a given 
closed boundary space curve T (where the projection of T on the xy plane is a simple plane curve 
OQ). So, the problem is a variational problem with two independent variables (i.e. x and y) and one 
dependent variable (i.e. z). Now, from elementary calculus we know that the area of such a surface is 


given by: 
a= ff \/l + 22 + 22 dx dy 


where z, = 0z/0x and z, = 0z/Oy. So, our functional integral is [[z] = o and hence F(x, y, Z, Zz, 2y) = 
\/1+ 224 22. Accordingly, the Euler-Lagrange equation for this Problem is (see Eq. 15 noting that 


L,Y; 2, Zn, Zy here correspond to £1, ©2,Y, Yx,,Yx. in Eq. 15): 


a Ct EG eee ee acs re ee 
eer 24 42 | 42 WN gases ame 2 2 = 
avi tet 3, (= nye 4) Oy (= i 22 #4) P 


[24] “Optimized” here should mean minimized since the area of such a surface can diverge. 
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Zets Be A Dope 28 Ba.) de a By (Qe eee Bee) =" 
jitete 2(it2422)” jit#+2 2(1+24+2)"" 
(= (1 + eee zy) 2a Bete t 2 | (2 (1 eo x) By (ZeZey + st | af AG 
(1 +224 2)? (1 22 2? (1 +224 2)? (1 +224 2)? 
(= + Zee Ze 4 Bugke ae “ste | _ (= + Zyy22 + Zain — 2y2yZny — Eu) Soh 
(1+ 224 22)°” (1+ 224 22)°” 
2na + 2aa2y — Za2y2ye , Zyy + 2yy2z— ZeryFay , 
Q+242)7 —  (14242)°? 
Zee + uke — zy 2yZay + Zyy + Coen mr 
(14+ 22+ 22)°” 
Zee + Pine, — 2Z_2yZny + Zyy + Zp te = 0 (17) 


We remark that in line 3 the partial differentiation with respect to x and y includes implicit as well as 
explicit dependencies on these variables (as explained in the text), while in line 7 we used zyx = Zny. 
Note: Eq. 17 defines the provision for minimal surfaces!?5] with the above given conditions (noting 
that some minimal surfaces may require a slight modification to the above conditions with regard to 
the boundary). 

5. Show that planes are minimal surfaces. 
Answer: A plane in 3D Euclidean space (defined over a given domain in the xy plane with a given 
boundary) is defined by the equation z = ax + by + c (with a,b,c being constants).!?61 Accordingly, 
Zee = Zey = Zyy = 0 and hence Eq. 17 is satisfied identically. So, the plane is a solution to Eq. 17 and 
hence its area is minimum (according to Problem 4), ie. it is a minimal surface, as required. 

6. Find the extremizing (or stationarizing) function of the following functional integral and suggest a 
specific solution that satisfies the given boundary conditions (as well as the given constraint): 


i= f | (22 — 22) dedy MO =waW=shaj=se SU #(0:5205) =4 


Answer:??71 Using Eq. 15 with F(2,y, 2,22, 2y) = 22 — z, (noting that 71,22, Y,Ye1,Ya2 in Eq. 15 


correspond to 2, y, 2, Zz, Zy here) we get: 


0 2 3 0 0 2 2 0 0 2, 2 = 
g'-l- a (a,@-4l)- g(a) = © 
) ) 
0 Da (2z) By | 22,) = 0 
Zax — Zyy = O 


This is the Euler-Lagrange equation of this Problem. We can suggest the following solution (which 
satisfies all the given boundary conditions and constraint as well as the Euler-Lagrange equation): 


z = sin (72) sin (zy) 


Note: this solution is plotted later in the book (see the lower frame of Figure 73). 


[25] Minimal surface is a surface whose area is minimum compared to the area of any other surface that shares the same 
boundary curve. Common examples of minimal surface are planes, catenoids, helicoids and ennepers. 

[26] The fact that some planes may be defined differently (e.g. y = 6) does not affect the generality of our assertion because 
with a simple transformation (which does not affect the geometrical properties) of the plane or the coordinate system 
(e.g. rotation) the plane can be defined by an equation of the above form. 

[27] In this answer (as well as in the answers of the following Problems in this section), we omit many details and possibilities 
to avoid exceeding our space and purpose. 
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Figure 2: The solution z = sin (72) sinh (ry) of Problem 7 of § 1.6. 


7. Find the extremizing (or stationarizing) function of the following functional integral and suggest a 
specific solution that satisfies the given boundary conditions and plot the solution: 


I[z]= [ [ (22 + 22) da dy z(0,y) = z(#,0) = z(1,y) =0 z (a, 1) = sin (ra) sinh (77) 


Answer: Using Eq. 15 with F(a,y,2, 2,2) = 22 + 24 (noting that 71, 22,4,Ye,,Ye2 in Eq. 15 
correspond to x, y, 2,22, Zy here) we get: 


Oo, 02 Of O72 2 0 (97. 12 = 
De [224 Z| an ( Dz, [24 Z| Oy oe, [e+ x = 0 
0 O 
0- an (2z2) — a (2zy) = 0 
Zag + Zyy = O 


This is the Euler-Lagrange equation of this Problem. We can suggest the following solution (which 
satisfies all the given boundary conditions as well as the Euler-Lagrange equation): 


z = sin (7a) sinh (ry) 


This solution is plotted in Figure 2. 
8. Find the extremizing (or stationarizing) function of the following functional integral and suggest a 
specific solution that satisfies the given boundary conditions: 


1 pl 2 
id=] i (2242-02? + =) dx dy z(0,y) =z(#,0)=z(l,y)=0 = z(a,1) =sin (72) 
0 JO y 


Answer: Using Eq. 15 with F(z, y, 2, 2x, 2y) = 22 + 27 — 72? + Ae (noting that 21,22, Y,Ye1, Ye. mM 
Eq. 15 correspond to 2, y, Z, Zz, Zy here) we get: 


rs) 222 6) 6) 22? 
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10. 


0 0 2 227 
al |4, +2, -—We2 + = 0 
Ay \ Azy 
4 0 
nz + — 5 (222) 5 (22,5), =. 0 
Qn*z+ 22ne — 2Zyy = O 
2 
Zee + Zyy +772 a - 0 


This is the Euler-Lagrange equation of this Problem. We can suggest the following solution (which 
satisfies all the given boundary conditions as well as the Euler-Lagrange equation): 


z=y’ sin (rz) 


Note: this solution is plotted later in the book (see the lower frame of Figure 74). 


. Find the extremizing (or stationarizing) function of the following functional integral and suggest a 


specific solution that satisfies the given boundary conditions and plot the solution: 


me. ze +z; —4z) dx dy 


z(0,y) =0 z(x,0) = z(ly)=y-1 z(e,l1)=a-—2° 


Answer: Using Eq. 15 with F(z, y, z, Zz, - = 224+ ee — 4z (noting that %1,%2,Y,Ye,;Yr, in Eq. 15 
correspond to 2, y, 2, Zz, Zy here) we get: 


a eee 0 O p22 a Opa. 2 = 
3z lee + Zy 42] Da (2 Ee + Zy 4z] Dy Oe [zz + 2y — 42] = 0 
(7) () 
4 an (2zz) Dy (2z,) = 0 
Zag t2yyt2 = 0 


This is the Euler-Lagrange equation of this Problem. We can suggest the following solution (which 
satisfies all the given boundary conditions as well as the Euler-Lagrange equation): 


z=ay—x" 
This solution is plotted in Figure 3. 
Find the extremizing (or stationarizing) function of the following functional integral and suggest a 
specific solution that satisfies the given boundary conditions: 


= ff teh) aeay z(0,y) = z(2,0) =z(1,y) =2(z,1) =0 


Answer: The functional integral of this Problem is the same as the functional integral of Problem 9 
and hence the Euler-Lagrange equation for this Problem is the same as for Problem 9. We can suggest 
the following series solution: 


amen! aS 5 ! a i Toa sin (maa) sin (ny) 


mn ( a5 +n?) 


n=1m=1 


Note: this solution is plotted (up to and including m = n = 7) later in the book (see the lower frame 
of Figure 75). 


1.7 Variational Problems with Multiple Dependent Variables 44 


y x 


Figure 3: The solution z = xy — 2? of Problem 9 of § 1.6. 


1.7 Variational Problems with Multiple Dependent Variables 


In some variational problems the functional J depends on more than one dependent variable (i.e. extrem- 
izing function). For example, J may depend on two extremizing functions y; = y1 (x) and y2 = y2 (x) and 
hence I (which is originally given by Eq. 1) becomes: 


z2 
I lyr, ye] =| F (a, Y1,Y25 Vi 99) dx (18) 


1 
where J is the functional integral whose optimization depends on the extremizing functions y; and yo, 
the prime stands for d/dz, and the functional is constrained by four boundary conditions: y; (a1) = C1, 
yi (a2) = Co, ye (a1) = D1, yo (a2) = Dg (with Cy, C2, D,, D2 being constants). In this case there will be 
one Euler-Lagrange equation (see Eq. 2) for each extremizing function, that is: 


OF d (OF 
a eee ee fe = 1 
Oy, da & : ae 
OF d (OF 
a if 2 = 2 
Oy2 dx (sr) : ve 


where these equations should be solved simultaneously!?8! to obtain the solution of the variational prob- 
lem. The above formulation can be easily generalized when the functional depends on more than two 
extremizing functions (see Problem 1; also see § 6). 


Problems 


1. Write down the functional integral J and the Euler-Lagrange equations when J depends on n dependent 
variables yi(@),-++ , Yn(x). 
Answer: 


v2 
Ty uel =f F(z, Y1)** Yar Vist? Yn) ae 
Ly 


[28] Simultaneous here does not necessarily mean they are linked as a system. 
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2. Obtain the Euler-Lagrange equations for the following functional integrals (of multiple dependent 
ree 


(a) I eg + az” + 3bay — cz) dx (with a, b,c being constants). 
(b) I [2 (r? +Cr-1 +r?) ds. 
(c) I ae se (4? + 43) dt. 
Answer: 
(a) We have F (x,y, z,y’, 2’) = y’3 + az”? + 3bry — cz and hence the Euler-Lagrange equation for y is: 
3) d {0 
By ly’? + az!* + 3bary cz] de & [y’? + az’? + 3bary «) = 0 
d 
3ba — — (3y”) = 0 
w — = (3y") 
3ba — 6y'y"” = 0 
6y'y" —3b2 = 0 
while the Euler-Lagrange equation for z is: 
a) d {0 
De [y’? + az”? + 8bzy — cz] i & [y’? + az”? + 8bry — ce) = 0 
—c— — (2az’) = 0 
—c—-2az"” = 
Qaz" +e = 


(b) We have F'(s,r,¢,7’,¢’) =r? + Cr-! + r?7¢” and hence the Euler-Lagrange equation for r is: 


oO [r’? + Cro? rp” d ( 0 [r’? or +126] = Q 
ir s \ Or’ 
Cr-? + 2rd” d (2r’) = 0 
ds 
Cr-? + 2rd -—2r" = 0 
Qn" + Cr-?7-2r¢? = 0 
while the Euler-Lagrange equation for ¢ is: 

6) = d 3) = 
a [r+ Cr 14 72g] - (= ae gag ae rg?) a G 
0-2 (re) = 0 
— (4rr'¢! + 2r°¢"") = 0 
arr'd' +r76" = 0 
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% = 0 
Similarly, the Euler-Lagrange equation for x2 is #2 = 0. So, the Euler-Lagrange equations are: 
x, =0 (i = 1,2) 


. Find the extremizing (or stationarizing) functions of the following functional integrals as well as the 
specific solutions for the given boundary conditions: 
(a) Ty, ye] = fo (y2 +9? + + 4y2) de with y: (0) = 1, y: (1) = 9, y2 (0) = 1, yo (1) = 0. 

m/2 ci 
(b) I [y1, ye] = aa (yi? —yityS+y3) dx — with y (0) =1, y: (#/2) = 1, yo (0) = 1, yo (7/2) = 1. 
Answer: 
(a) We have F = y? + y7 + y? + 4ye and hence from the Euler-Lagrange equations (Eqs. 19 and 20) 
we get: 


O 12 2 12 d a) 12 2 12 
ze + 4 A = 
Dan [yy + yi t+ uF + 4y2| dn \Oyh [yy + ui t+ uF + 4y2| 0 
d 
Die Oye = 
Y1 ae ( Yi) 0 
2yi—2yf = 
// 
Yi SYle SS 
) 12 2 12 d ) 12 2 12 
bye ty 44 4 = 
a [yy + yi t uy + 4y2| ae Op [yy + ui t+ uF + 4y2| 0 
d 
A= — (0) = 
4—2y5 = 
Yo = 


Hence, y; = C, cosha + D, sinha and yy = x? + Cox + Do. 
From the given boundary conditions (respectively), we get: 


C,cosh0+ D;sinhO = 1 = Cy =1 
C,cosh1+ D,;sinhl = 0 > D, =—coth1 
0+0+Dz. = 1 > Dz=1 
14+€,;+Dz = 0 > Cy = -2 


Therefore, the specific solution is: 
yi = cosha — (coth 1) sinh x ye =o? — Qa +1 


(b) We have F = y/? — y7 + y¥ + y2 and hence from the Euler-Lagrange equations (Eqs. 19 and 20) 
we get: 


a d {0 
Dy, Wt ut tue +98) = (a [ur it +ue +ui)) = 0 
d 
265s os =. |G 
Y1 dg (eu) 
—2y,-2yf = 0 
yit+y = 0 
a d {0 
Dyy Wt vt tae + 9] oe (ss [yi it +uf + ul) = 0 
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d 
2y2 — ae (2y5) = 0 
2y2 — Qys = 0 
yo-yo = 0 


Hence, y; = Ci cosx + D, sina and y2 = C2 coshaz + D2 sinh. 
From the given boundary conditions (respectively), we get: 


C,cos0+D,sin0 = 1 > Cy =1 
Cycos + + Di sin = 1 = Des 
Cycosh0O+ DesinhO = 1 s Cy =1 
C> cosh = + Do sinh = 1 = D h— — coth = 
Cc _ —_— —= = — —co —_— 
2 COS 5) 2 S51 5) 2 CSC. 5) Cc 5) 
Therefore, the specific solution is: 
2 Tv T\ 
y1 =cosx+sina yo = cosh x + (csch$ — coth =) sinh x 


1.8 Variational Problems with Constraints 


In some variational problems certain constraints are imposed on the sought solution and this requires 
embedding these constraints as additional conditions within the variational formulation using a certain 
technique. However, before we talk about this type of variational problems in the calculus of variations we 
need to talk briefly about the Lagrange multipliers technique in general which is usually used in dealing 
with this type of variational problems in the calculus of variations. 

The Lagrange multipliers technique is an analytical method used in the search for extremums of functions 
when certain constraints are imposed on these functions. The theoretical basis of this technique is that 
a function f(x) on which a given constraint g(x) = c (with c being a constant) is imposed has an 
extremum at a point P iff the function f + Ag has an extremum at P (where the parameter \ is called 
the Lagrange multiplier).?9! So, instead of extremizing f in our search we extremize h = f + Ag (so 
that the constraint is included in the formulation). More than one constraint can be embedded using 
more than one Lagrange multiplier (i.e. one Lagrange multiplier for each constraint). For example, if a 
function f is to be extremized subject to three constraints gi, 92,93 (i.e. g1 = C1, g2 = C2, and g3 = Cs) 
then the extremization formulation (according to the technique of Lagrange multipliers) will take the form 
f +Aigi + A2g2 + A3g3 (with A1,A2,A3 being the Lagrange multipliers), and hence we should extremize 
h= f +Argi + A292 + Azsg3 (instead of f) in our search. 

Now, returning to the issue of the variational problems with constraints in the calculus of variations it 
should be obvious that the Lagrange multipliers technique as described above (with some simple adapta- 
tions) can be used to deal with this sort of problems in the calculus of variations. Accordingly, we can 
say: a functional , = i F dz on which a given constraint Ip = Jee G dx = C (with C being a constant) 
is imposed is extremized by the function y(x) iff the functional I[y] = I, + Aly is extremized by the 
function y(x). So, instead of extremizing J, in our search we extremize J, and hence the functional of Eq. 
1 becomes: 


iy] = i F(eyy) de+a f Cowie 


1 


[ [Feu +ree@uy)] a 


1: 


[29] In fact, “extremum” in the above statement represents the issue of interest in the calculus (or mathematics) of variations 
(otherwise “stationary” is more inclusive). Also, whether we should use if or iff in the above statement depends in our 
opinion on how we view the problem (i.e. whether it is a variational problem or a constrained variational problem). 
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x2 
= [Huy ae (21) 
1 
where H = F + AG. Accordingly, the Euler-Lagrange equation (see Eq. 2) becomes: 


oH a (Fr) =(0 (22) 
Oy dz \ Oy’ 

Again, more than one constraint can be embedded using more than one Lagrange multiplier. Therefore, 

if we have n constraints then the variational formulation will take the form F’' + \,G, +---+AnGn, and 

hence we use the integrand H = F + A,G,+--:+AnG, (instead of F’) in our functional integral with the 

employment of Eq. 22 (instead of Eq. 2). 

Regarding the solution of this type of problems, we note that the Euler-Lagrange equation is a second 
order differential equation and hence the solution y usually contains two constants of integration. More- 
over, it contains n Lagrange multipliers \’s (where n is the number of the given constraints). Hence, the 
obtained solution usually contains n + 2 unknowns. Accordingly, to obtain a specific solution we should 
use the two boundary conditions at the end points plus the n constraining conditions ee G,dzx = C; 


(i =1,--- ,n). This will be applied and clarified in some of the upcoming Problems.!°°! 

We should finally note that although the Lagrange multiplier(s) \ enter in the formulation of variational 
problems with constraints, in many cases the determination of A is irrelevant to the required solution (even 
though the determination of \ may be needed provisionally in some of these cases).!°4!_ Accordingly, » 
(or A’s) is just a tool to obtain the solution and hence the reader should not worry about 2 and its 
value or nature if the sought solution is obtained. Yes, there are some types of problems in which » 
(or 4’s) has certain mathematical or physical significance and hence the determination of > may be 
desirable or even indispensable. These issues will be seen (and investigated in practical terms) in the 
upcoming Problems which are solved by this technique (or they are linked to this technique). We should 
also note that the Lagrange multipliers technique does not necessarily lead to extremization (since it is 
essentially a stationarization technique which is more general) although we were generally talking above 
about extremization (which is the main purpose of the variational techniques). So, further investigation 
to determine the nature of the obtained solution (i.e. minimum or maximum or inflection or saddle) may 
be required. 


Problems 
1. Justify the logic of the Lagrange multipliers technique using a simple argument. 
Answer: Starting from Eq. 21 we have: 
x2 x2 x2 2 
ea Hae = | (F + AG) ax = f Pac +d Gdz =I, + Alz 
Ly Ly xy 1 
So, if J; is extremum (or stationary) and Jz is constant (according to the constraint) then I should 
also be extremum (or stationary). 
2. Outline the procedure of the Lagrange multipliers technique in solving constrained variational prob- 
lems. 
Answer:!*?! If f(a) is a function to be optimized (or stationarized) subject to a constraint g(a) = ¢ 
(with c being a constant), then according to the procedure of the Lagrange multipliers technique we 


do the following: 
e We define a new function h = f + Ag where X is an extremizing (or stationarizing) parameter. The 


[30] As indicated in the phrasing of this paragraph, we are talking about a typical problem of this type; otherwise some 
problems may not strictly follow the above description and procedure. 

[311 In fact, this is behind the “Lagrange undetermined multiplier” that is used to label \ in some texts. 

[32] This answer is rather generic and lacks rigorous technicalities. The purpose of it is to give a general idea about the 
Lagrange multipliers technique rather than a rigorous treatment and formulation. However, the technique will be more 
clarified by the many upcoming constrained variational Problems in this book which are formulated and solved by this 
technique. 
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function h is commonly known as the Lagrangian while the parameter \ is commonly known as the 
Lagrange multiplier (or Lagrange undetermined multiplier). 

e We optimize h by taking its derivative with respect to x and equating the derivative to zero. The 
solution of this equation will yield the optimal point(s). 

e If we have more than one constraint (say n constraints g; = C1,:-: ,9n = Cn) then h is defined as 
h= f+ Aig. +-+:+Angn and the above procedure (of taking derivative and equating it to zero) is 
repeated. 

e If f and g (or g’s) are multi-variable functions!**! (say they are functions of x1,--- ,@m) then we 
take the partial derivatives of h with respect to these variables and equate the derivatives to zero 


(ie. ge =0,---, eh. = 0). The solution of this set of simultaneous equations will yield the optimal 
point(s). 


e In the case of optimizing a functional (rather than a function) which is usually dealt with by the 
techniques of the calculus of variations, the above procedure is amended to cope with this altered 
situation. So, in the calculus of variations (where the functional is an integral) the function H (which 
is the integrand of the functional integral where H = F + AG or H = F+21G, +--+ AnGp) is 
used as an input to the Euler-Lagrange equation (see Eq. 22) whose solution will yield the optimal 
function(s). Although this procedure looks rather different from the above-described procedure of the 
Lagrange multipliers technique, it essentially rests on the same logic and rationale. 

3. We have a 3 meter rope which we want to shape into a rectangle such that the enclosed area is optimal 
(i.e. maximum). Find the dimensions of the required rectangle. 
Answer: This is a constrained variational problem where we want to optimize a function (which is 
the area o of the rectangle) subject to the constraint that the perimeter p of the rectangle is equal 
to 3. Now, if the lengths of the two sides of the rectangle are x and y then f (x,y) = o = zy and 
g(x,y) =p=2(a+y) =3 and hence h = ry + A2 (a + y). Accordingly: 


Oh 


— = 2= 
Aa y+ 0 
Oh 

ss A2 = 0 
Oy sda 


On subtracting the second equation from the first we get y— x = 0 and hence y = x which means that 
our rectangle is a square of side length « = 3/4. 
Note: we may use a single variable approach for solving this Problem where f (1) =o = 2#(1.5-—2) = 
1.52 — 2? and g(x) = p = C = 3 (where C is a constant) and hence h = f + Ag = 1.5¢ — 2? + AC. 
Accordingly: 

dh 


—=15-%2% = 
ae 5 xz = 0 


which leads to the same solution, i.e. 2 = 3/4. However, this in reality is a non-constrained approach, 
i.e. the constraint is actually embedded in the formulation of f rather than being imposed as an 
additional condition in the formulation. 

4. What is the Lagrange multipliers formulation for a functional [ F' that to be optimized subject to n 
constraints [ G,--- , [ Gn. 
Answer: The formulation is given by Eqs. 21 and 22 with H being given by: 


H=F+S XG; 
i=1 
5. Find the Euler-Lagrange equations for the following constrained variational problems: 
(a) F(z,y,y') = cy”? with G = x7y?. 
(b) F(z, y,y') = yy? with G = zy. 


[33] We note that the Lagrange multipliers technique is generally associated with multi-variable functions (rather than single- 
variable functions). In fact, even when f is originally a single-variable function the Lagrange multipliers technique usually 
leads to multi-variable formulation. 
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(c) F(a, y,y') = y/y’ with G = az? + b (a and b are constants). 
(d) F(t, 2,4) =x with G = az? (a is a constant). 


Answer: 

(a) We use Eq. 22 with H=F + \G = xy”? + Ax?y? and hence the Euler-Lagrange equation is: 
) d { 0 
since £ (Bian) = 


2r27y — “ (2axy’) 
x 


| 
o 


l 


Qra97y — 2y’ — 2ay" 
ay +y' —Ax*y = 
(b) We use Eq. 22 with H = F + AG = yy’? + Avy and hence the Euler-Lagrange equation is: 


a) 
By [yy’? + Avy] : ( pil [w/? + Azy] = 0 
y? 4 hr —- — (3yy7) = 0 
y? + \x — 3y’8 = 6yy!y"” = 0 
6yy'y” + 2y%—rAx = O 
(c) We use Eq. 22 with H=F+AG "a + \ax? + Xb and hence the Euler-Lagrange equation is: 
6) y 2 
tae + Ab 7 + Aax® + Ab = 0 
Oy y 
1 d y 
ale) = 
1 y’ Qyy”’ 
yt ye ys = 
2  2yy"’ 
yoy - ° 
yy SE 0 
13 po 
y ¥ 


(d) We use Eq. 22 with H = F+AG = x+ az? and hence the Euler-Lagrange equation (noting that 
x,y,y’ in Eq. 22 correspond to t,x, here) is: 


2 [x + rak?| _ “ (F [x + Aaé*]) = 0 
d 
1- Fr (Q\az) = 0 
1-—2\az = O 
1 
exe Sine, j= 10) 
2a 


6. Find and solve the Euler-Lagrange equations for the following constrained variational problems: 
(a) F(z,y,y') =y and G= 1/1 + y!2, 
(b) F(z, y,y') > yl? and G => xr. 
Answer: 
a) We use Eq. 22 with H=F+AG=y+AvV/14+y?”, that is: 
y y 


5 t Ay/1 4 y?| . (a ly avi+y"]) = 0 
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af \ _y 
dx 1+y? -_ 


This is the Euler-Lagrange equation which we solve as follows: 


Ay! 7 C 
Jive 
My? = (#+C) (1+y") 
y’? _ (x +C)? 
Ya (e+ 6) 
fe a) 
VX -(@+0? 
y = F/d-(e@+C)?+D 
(e+C)+(y—D)? = » 


(b) H= F+AG = y'1/2 + Xx? which is independent of y and hence we can use Eq. 4 (with H replacing 
F), that is: 


) 
ye eee 
1 
Qy!1/2 


This is the Euler-Lagrange equation which we solve as follows: 


De ae 
u 402 
x 
= — ,+D 
‘ 407 * 
7. Find and solve the Euler-Lagrange equations for the following constrained variational problems (subject 

to the given boundary conditions and constraints): 
(a) F(z,y,y') =y? — y? and G=y with y(« = 0) =0, y(w@ = 7) = 1 and a Gdzxr=1. 
(b) F(z, y,y') = y? and G = —y? with y(x = 0) = 0 and y/(4 = 1) = 0. 


Answer: 
(a) We use Eq. 22 with H=F+AG=y? — y? + Ay, that is: 
) d { 0 
sip rent-a(e nore) «8 
dios 
—2y+A- aa (2) = 0 
—2y+A-—2y” = 0 
BS a A Po 
y y 5) 


So, the solution is y = C cosa + Dsina + a which can be checked by substitution, that is: 


A 
r rN | 
Coos + Dsina + 5) + [Coos + Dina + =5 - 9 
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‘ ' : mr vA ? 
[ -— Csina + Dcosz + 0] + Coosz + Dsinz + 5 Set 0 
¢ . mr vA 2 
[ — Ccosa — Dsina] + Coosx + Dsina + 5 a tee 0 
0 = 0 
0 we get C+ 4 = 0 and hence C' = 3 while from the condition 


Now, from the condition y (a = 0) = 
y(« = m) = 1 we get —3(—1) + 0+ 4 = 1 and hence A = 1. Thus, the solution becomes y = 
—} cosxz + Dsinz + 5. Also, from the constraint we get: 


eo 1 1 
| (—5eose+Dsine +5) dx = 1 
a 2 
| Ceose + 2Dsine +1) de =y 2 
0 
[-sinz —2Dcosr+a2]h = 2 
[-0 —-2D(-1) + 7]-—[-0-2D+0] = 2 
2D4+7+2D = 2 
4D+n7 = 2 
D= 2-7 
4 
Therefore, the solution is: 

Das + cil si 43 
=—=cosx —— }sinx+-—- 
aa 4 2 

(b) We use Eq. 22 with H = F+ AG = y’? — Ay’, that is: 
O +p 5 d {0 
r 2 _dy?]) = 0 
ay ly” — 0] — ae | ay le? — A] 
d 
y= ey yk & 
Ay — a (2y’) 0 
y’ +rAy = 0 


Now: 
If \ < 0 then y = acosh ( Ale) + bsinh ( le) [with a,b being constants] and from the first 


boundary condition we have 0 = acosh(0) + bsinh(0) [and hence a = 0] while from the second 
boundary condition we have 0 = b,/|A| cosh (Vil) [and hence b = 0 because cosh (Vil) = 0 has 


no real solution] . So, the solution is y = 0. 

If \ = 0 then y = az + b (with a,b being constants) and from the first boundary condition we have 
0 = a0+b (and hence b = 0) while from the second boundary condition we have 0 = a. So, the solution 
is y = 0.134) 

If \ > 0 then y = acos (Vr) + bsin (vie) [with a, b being constants] and from the first boundary 
condition we have 0 = acos (0) +6sin (0) [and hence a = 0] while from the second boundary condition 
we have 0 = bVXcos (vr) [and hence if we assume non-trivial solution, ie. b 4 0, then V\ = 77/2]. 


So, the solution is y = bsin (3). 


[34] We consider the case \ = 0 for the sake of comprehensiveness (considering more general situations); otherwise the 
multiplier \ is not supposed to be zero. 
[35] Tp fact, this is the principal solution; otherwise VA can be an odd multiple of m/2. 
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8. Obtain a specific solution for part (b) of Problem 7 assuming y > 0 plus the following constraint 
condition: ifs Gdz =—n. 
Answer: From the given constraint we get: 


[ —b* sin” (=) dz = -T 
2 1 
2s E = sin (nx) = -7 
2 ) 
b2 
—-=(1-0-0+0) = -7 
b — rv 2 


Therefore, the specific solution (taking the positive root only since y > 0) is: 
y = V2nsin (™) (0<a<1) 


9. Find the Euler-Lagrange equations for the following constrained variational problems: 
(a) F(a, y,y') = 27y” with G, = zy and G2 = yyy. 
(b) F(z, y,y’) = y"/? with G, = y? and Gz = ax? (a is constant). 
Answer: 
(a) We use Eq. 22 with H = F+21G14 2G = 27y? + \ryt+Aayvy' and hence the Euler-Lagrange 
equation is: 


5 [xy + Auwy + vy] - ‘ (5 Ex + yey + dvi) iG 
Mae + AoV/y! — < (20° + ost) = 0 

Ma 4 doy! Ary’ — 2a?7y" ta agit = 0 

die -+ doy — dary! — 202g!" * Sg = _ 9 

Ayu + ay —4Agy! — Qa? y" + ae = 0 


(b) We use Eq. 22 with H = F + 1G, + A2G_ = y'"/? + \yy? + Apax* and hence the Euler-Lagrange 
equation is: 


+. yi? + Ay” 4 Azan] = < (= [v2 + Ay? + doar ) a 
2d.y + wins = 0 
y" + 8rAyy?/? = 0 


1.9 Variational Problems with Variable Boundaries 


So far, the variational problems that we dealt with have fixed boundaries. In some variational problems, 
the boundary is partially or totally variable. In fact, there are many cases and details in this regard. 
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So, in this section we briefly investigate only some simple cases of variational problems with variable 
boundaries. 3° 

The simplest of these cases is when the variational problems (of single variable) have only one fixed end 
point and hence the other end point is variable. In this case the Euler-Lagrange equation (i.e. Eq. 2) will 
be used as before but with a modification to the boundary conditions where a transversality condition 
will be imposed on the variable end point (while the fixed end point keeps its fixed boundary condition as 
before). To be more specific, let have a curve [ represented by a function yp = y(x) and it connects a fixed 
point A to acurve ; where the end point B of T is restricted to move freely on I’; (as depicted in Figure 4). 
In this case, the transversality condition states: if the value of the functional I [yr] = (oy F (a,y,y') dx is 
optimal (with respect to neighboring curves connecting point A to curve [';) then at point B the direction 
dX : dY of Ty and the element of I should satisfy the relation: 


Figure 4: A schematic illustration of the setting of variational problems with variable boundaries in the 
simplest case when a curve [ (that is to be optimized) connects a fixed point A to a given boundary curve 
T, where the end point B of T is restricted to move freely on Ty. See § 1.9. 


F dX + (dY —y'dX) Fy =0 (at point B) (23) 


where y’ belongs to Tat B while dX and dY belong to Ty at B, and Fy is the partial derivative of F 
with respect to y’. 

Another case is when a curve [ represented by a function yr = y(x) is connecting two other curves (Ty 
and T'2) where the end points (B; and Bg) of T are restricted to move freely on Ty and [2 (as depicted in 
Figure 5). In this case, the transversality condition states: if the value of the functional I [yr] is optimal 
(with respect to neighboring curves connecting T, and Iz) then at the points B; and Bg the directions 
dX, :dY, of Ty and dX2 : dY2 of [2 and the corresponding elements of I should satisfy the following two 
relations: 


FdX, ale. (dY, — y dX,) Py = 0 (at point B,) (24) 
F dX2 + (dY2 —y'dX2) Fy = 0 (at point Bo) (25) 


where again y’ belongs to I at B; and Bz while dX, and dY, belong to T'; at By and dX» and dY2 belong 
to T> at Bo. 


[36] Variable boundaries may also be called free boundaries because free movement within certain restrictions is allowed. 
However, in our view “variable” is more appropriate than “free” because it is not really free. 
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Figure 5: A schematic illustration of the setting of variational problems with variable boundaries in the 
case when a curve I (that is to be optimized) connects two given boundary curves (I; and ['2) where the 
end points (B, and Bz) of [ are restricted to move freely on Ty and Ty. See § 1.9. 


Finally, we should draw the attention to the following important remarks: 
e Although in many cases the transversality conditions lead to the perpendicularity of the extremal curve 
and the boundary curve (ie. T 1 Ty or TF L Tg), this is not necessarily the case and hence in the 
variational problems with variable boundaries the transversality conditions may not lead to orthogonality. 
e To avoid potential confusion, the reader should note that the transversality condition may be given in 
some texts by other forms like the following: 


(F —y' Fy) dX + FydY =0 (26) 


which is identical to Eq. 23 but with different ordering and grouping of terms. The transversality equation 
may also be “divided” by dX and hence Eq. 23 (as well as Eqs. 24 and 25) becomes F'+ (Y’ — y’) Fy =0 
where Y’ = dY/dX (and similarly Eq. 26 becomes F — y'Fy + Fy Y’ = 0). 
Problems 
1. Solve the following variational problems (in which we have one fixed boundary and one variable bound- 
ary): 
(a) F(z,y,y’) = y? + 2yy’ — y? with fixed boundary y(0) = 1 and variable boundary x = 7/4. 
(b) F(z,y,y') = y’ — xy’ with fixed boundary y(0) = 5 and variable boundary x = 3. 
(c) F(z,y,y’) = /1+ y/y with fixed boundary y(0) = 0 and variable boundary y = x + 2. 
Answer: 
(a) In this case we have one point of the curve fixed by the condition y(0) = 1 while the other point 
of the curve is free to move on the vertical line x = 7/4. In part (j) of Problem 11 of § 1.4 we solved 
this Problem (without boundary conditions) and obtained the solution y = asina+bcosx. Now, from 
the fixed boundary we get b = 1 and hence y = asinx + cosx. Regarding the variable boundary, we 
impose the transversality condition: 


FdX +(dY —y'dX) Fy =0 


Now, since the boundary curve x = 7/4 is a vertical line then dX = 0 (noting that in the transversality 
condition y’ belongs to the extremal curve while dX and dY belong to the boundary curve) and hence 
the transversality condition becomes dY Fy = 0. If we now note that on a vertical line dY # 0 then 
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the transversality condition will be reduced to: 


Py = 0 

Qy'+2y = 0 

y+y = 0 

(acosx —sinx)+(asinz+cosz) = 0 
(a+1)cosx+(a-—1)sinz = 0 

at+l1 


l-a 
at+l1 


=a = 1 (a = 1/4) 
a = 0 


So, the solution is y = cos x (which is plotted in Figure 6). The two curves meet at point (4, Z): 


0.8 
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X 


CN i 


Figure 6: Plot of the extremal curve y = cos x (solid) and the boundary line x = 7/4 (dashed). See part 
(a) of Problem 1 of § 1.9. 


(b) We have F = y’? — zy’ and hence the Euler-Lagrange equation (noting that y is missing in F and 
hence we use Eq. 4) is: 
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On integrating this equation once we get: 


oO 
=—+— D 
Yy q oT 5 r+ 
Now, from the fixed boundary y(0) = 5 we get D = 5 and hence y = ae + $x+5. Regarding the 
variable boundary, we follow similar procedures and arguments to those of part (a) of this Problem 
and hence we get the following transversality condition: 


Fy 0 
2y’-a = 0 
, _ «£ 
Me tS Se 
rc C  & 
272 > 2 
C = 0 


Accordingly, the solution is y = = +5 (which is plotted in Figure 7). The two curves meet at point 
(3, 7.25). 


(Oe 
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Figure 7: Plot of the extremal curve y = ae + 5 (solid) and the boundary line x = 3 (dashed). See part 
(b) of Problem 1 of § 1.9. 


(c) We have F = \/1+ y’/y and hence the Euler-Lagrange equation (noting that x is missing in F’ 
and hence we use Eq. 3) is: 


JVl+y? ‘I 8 |/f/l+y? C 
y Oy! y 
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/1 12 U 
y yVlty? 
1 12 12 
Vity y _o 
y yVflt+y? 
1 12 12 
ry y a 
yJity? yJ/l+y? 
1 
ee SS 
yfl+y? 
1 
yr(ity?) = a 
This is the Euler-Lagrange equation which we solve as follows: 
1D Less C?y? 
= C242 
Pea V1 — C2y? 
a dy = xtdz 
V1 — C2y? 
V1 — C2y? 
cna = +74+C, 
Now, from the fixed condition y(0) = 0 we get Cy = —1/C and hence: 
JVI 2 oe 2 
C 7 C 
V1—-C%y2 = FCxr+1 
1—C?y? = C?x?=2Czr+1 
O77? -= C727 = 2Ce 
2 
yy = -a?+ a 
yo = —a2* + be (b = £2/C) 


Regarding the variable boundary, we impose the transversality condition: 


FdX + (dY —y'dX)Fy = 0 
dY 
F+(1-y)Fy = 0 (the boundary curve is Y = X + 2) 
VEE (kay yl! = 0 
y yVlty? 
14+ %/? yl! yl”? 4, 0 
y 1l+y? y 1l+y? y 1l+y? 
1 / 
+y tig 
yfl+y? 
1l+y' = 0 
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(27) 
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Referring to Eq. 27, the extremal curve is given by y? = —a? + ba. On differentiating this implicitly 

we get 2yy’ = —2x +b. Now, on the point of intersection (where the extremal curve and the boundary 

curve meet) we should also have y = x + 2 (since this point is on the boundary curve) as well as 
/ 


y’ = —1 and hence on substituting from y = x + 2 and y’ = —1 into 2yy’ = —2a + b we get: 


2(a+2)(-1) = -2r+6 
—24-4 = -2r+56 
b = —4 


Accordingly, the solution (taking the positive root) is y = ~—x? — 4x (which is plotted in Figure 8). 
The two curves meet at point (/2 — 2, V2). 


Figure 8: Plot of the extremal curve y = /—«x? — 4a (solid) and the boundary line y = x + 2 (dashed). 
See part (c) of Problem 1 of § 1.9. 


2. Solve the following variational problems (in which we have two variable boundaries): 
(a) F(z, y,y') =y? +y' + yy’ + 2y with zg =0 and « =2. 
(b) F(a, y,y') = f/14+ y? with y = 24 and y=2—-1. 
Answer: 
(a) In this case the left point of the extremal curve is free to move on the vertical line « = 0 while 
the right point of the extremal curve is free to move on the vertical line x = 2. The Euler-Lagrange 
equation is: 


12 


I 
o 


0 d 0 
By ly ty +40! +29] a(au ry! +w/ +2u)) 


/ d / 
yt2—— (yi +1+y) = 0 
y +2-—2y"—-y/' 0 
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y’ —1 _ 0 


So, the solution is y = $2? + Cx + D (with C and D being constants). This can be easily verified by 
substitution into the last equation. 

Following similar procedures and arguments to those of part (a) of Problem 1 we get the following 
transversality conditions: 


Fy =0 atx=0 and Fy =0 atx=2 


Accordingly, at = 0 we have: 


6) 
py Wo tal +uy'+2y) = 0 
2y’+1+y = 0 
1 
2(r+C+0)+14 (50? +c0 +) = 10 
2(0+C+0)+14+(0+04+D) = 0 (x = 0) 
2C0+D+1 = 0 
Similarly, at « = 2 we have: 
2(2+C+0)+1+(2+2C+D) = 0 (x = 2) 
4C+D+7 = 


Hence, C = —3 and D = 5. So, the solution is y = $u° — 3x +5 (which is plotted in Figure 9). The 
extremal curve meets with the boundary line x = 0 at point (0,5) and with the boundary line x = 2 
at point (2,1). 

(b) In this case one boundary point of the extremal curve is free to move on the curve y = x* while the 
other boundary point of the extremal curve is free to move on the line y = «— 1. Referring to part (i) 
of Problem 9 of § 1.4 (also see footnote [12] ), the Euler-Lagrange equation for this Problem is y’ = a 
and hence y = ax + b (with a and b being constants). Following similar procedures and arguments to 
those of the previous Problems we get the following transversality conditions: 


‘a 
Ft (= - ‘ By = 0 (for the boundary Y = «*) 
/ 
14 y’? (42° y') y = 0 


Jit 


1+y?+ (40° -y)y = 


1+ 423y/ = 
1+4c°a = 0 (28) 
AND 
dY ; 
Ft aye Fy - 0 (for the boundary Y = x — 1) 


Ji+y?+(1-y') = = 0 


l+y?+(1-y')y = 0 
1+y' 
l+a = 


a 


I 


I 
| 
ii 


(29) 
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Figure 9: Plot of the extremal curve y = $x? — 3x +5 (solid) and the boundary lines x = 0 and a = 2 


(dashed). See part (a) of Problem 2 of § 1.9. 


Therefore, the equation of the extremal curve becomes y = —x + b. So, what is left is to find 6. On 
substituting from Eq. 29 into Eq. 28 we get 1 — 42° = 0 and hence x = 47'/8, Now, the point with 
coordinate « = 4~!/3 is where the extremal curve y = —x + b and the boundary curve y = x* meet 


and hence we should have: 


-—r+b = x 
4-1/3 +b 4-4/3 
b 4-4/8 4 4-8 
b ~ 0.787451 


Accordingly, the solution is y = —x + 0.787451 (which is plotted in Figure 10). The extremal curve 
meets with the boundary curve y = x at point (4~1/3, 4-4/9) and with the boundary line y = x — 1 
at point (0.893725, —0.106275). 

3. Find the solution of the variational problem I [y] = = V1+y’? dx when we have: 
(a) Fixed boundary y(0) = 0 and variable boundary x = 1. 
(b) Fixed boundary y(1) = 1 and variable boundary x = 0. 
(c) Variable boundaries x = 0 and x = 1. 
Answer: Referring to part (i) of Problem 9 of § 1.4 (also see footnote [12] ), the Euler-Lagrange 
equation for this Problem is y’ = a and hence y = ax + b (with a and b being constants). Following 
similar procedures and arguments to those of the previous Problems we get the following transversality 


condition: ; 


Fy = _—___=0 
/L+y” 


So, the solution in all three cases (i.e. a, b and c) is y = b (ie. a horizontal line). Accordingly, all we 


and hence y =0 
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Figure 10: Plot of the extremal curve y = —x + 0.787451 (solid) and the boundary curves y = «* and 
y = «x — 1 (dashed). See part (b) of Problem 2 of § 1.9. 


need is to determine the value of b for each one of these cases, that is: 
(a) From the fixed boundary condition y(0) = 0 we get b = 0 and hence the solution is y = 0. 
(b) From the fixed boundary condition y(1) = 1 we get b= 1 and hence the solution is y = 1. 
(c) There is no restriction on b (apart from being a constant) and hence the solution is y = 6 (i.e. any 
horizontal line can be a solution). 

4. Use the transversality condition in the following variational problems with variable boundaries to 
determine y' at the boundaries:|37] 
(a) F=y?+y+C with boundary curves x = 1 and x = 2. 
(b) F=y” + zy’ with boundary curves x = a and x = 8 (where a and 8 are constants). 
(c) F = y? — 2yy’ with boundary curves x = a and x = b (where a and 6 are constants). 
Answer: The boundary curves in all these cases are vertical lines. So, referring to the previous 
Problems (see for instance part a of Problem 1) the transversality condition in all these cases should 
be given by Fy = 0. Accordingly: 
(a) In this case Fy, = 2y’ = 0 and hence we should have y’ = 0 at both boundaries, i.e. the slope of 
the extremal curve should vanish at the boundaries x = 1 and x = 2. So, the extremal curve at the 
boundary lines satisfies the following conditions: y’(1) = y'(2) = 0. 
(b) In this case Fy = 2y' + 2 = 0 and hence at the boundaries we should have y’ = —2/2. So, the 
extremal curve at the boundary lines satisfies the following conditions: y/(a) = —a/2 and y/(8) = 
—B/2. 
(c) In this case Fy, = 2y’ — 2y = 0 and hence at the boundaries we should have y’ = y. So, the 
extremal curve at the boundary lines satisfies the following conditions: y’(a) = y(a) and y’(b) = y(0). 


[37] This sort of problems is usually studied within the context of investigating natural boundary conditions in the calculus 
of variations. However, because we have no appetite for going through these details we posed this Problem in this rather 
primitive way. 
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1.10 Variational Problems of Mixed Nature 


In some variational problems we have more than one of the complications investigated in the previous 
sections (ie. § 1.5 to § 1.9). For instance, we may have more than one independent variable and more 
than one dependent variable or we may have more than one dependent variable with some constraints. In 
such cases the aforementioned treatments apply simultaneously. For example, if the variational problem 


has m dependent variables y1,--- , Ym and n constraints [Gj,---, {G, then the variational formulation 
: <. [38] 
is: 


v2 
Tly,::: smn] = f FEE igs Yrs Yas ea) dx 
Ly 
OH d (OH 
psy (essay ee (| aoe 
oar ener ae 


Pe 


i=l 


where H is given by: 


Problems 


1. State the variational formulation for a variational problem with two independent variables (a@ and ) 
and two dependent variables (y and z). Generalize this formulation to variational problems with m 
independent variables and n dependent variables. 

Answer: Combining the formulations of § 1.6 and § 1.7, the functional integral is: 


Tyal= ff F (a, 8, y, 2, Yas Zas Ys, 2p) da dB 
Q 
where yq = Oy/Oa (and similarly for za, yg, zs). Hence, we have two Euler-Lagrange equations: 
OF O (OF O (OF aap 
Oy Oa \ Oya AB \ dys) 
OF O (OF O (OF 
Oz 0a \ Oz OB \ dz 
To generalize this formulation, let denote the independent variables with x; (i = 1,--- ,m) and the 


dependent variables with y; (j = 1,--- ,n). On combining the formulations of § 1.6 and § 1.7 (using 
this notation), the functional integral is: 


| 
ro) 


T[yi,-°: Yn = ff F(ay,--: 9Umsy Y15°°* 5 Yn, Y11.°°* 5 Yims Unis ,Ynm) dx, -++d&m, 
Q 


where Ynm = OYn/OXm, (and similarly for similar notations). Hence, we have n Euler-Lagrange equa- 


tions: 
OF 90 (OF 
— 0 ;= L eee ,n 


where yj; = Oy;/O2j. 

2. State the variational formulation for a variational problem with two independent variables (t and x) 
and one dependent variable y where the functional J depends also on the second order derivatives of 
the dependent variable y (i.e. 0?y/0t?, 02y/dxdt and 07y/dx). 

Answer: Combining the formulations of § 1.5 and § 1.6, the functional integral is: 


I ly) =. | F (t,.2,Y, Yes Yas Ytts Yer Yor) dt dx 
Q 


[38] In fact, this formulation has some restrictions which are not given here (because the example is for demonstration only). 
For instance, in some problems the multiplier \ (or multipliers \’s) may enter as a dependent variable. 
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while the Euler-Lagrange equation is:!°! 


OF 0O/(0F Of OF \. « Of OF oo OLY -. OF (ORS. 0 

Oy Ot (=) Ox (=) ' Ot (=) ' Oxdt ($=) ' Ox? (=) > 
where y, = Oy/Ot, Ye = Oy/Ox, yur = O7y/Ot?, Yo = O?y/OxOt and Yer = O7y/Ox?. 

3. State the variational formulation for a variational problem with one independent variable x and two 

dependent variables (y, and y2) where the functional J depends also on the second order derivative of 
y, and the second and third order derivatives of y2. Generalize this formulation to variational problems 
with m dependent variables where the i‘” dependent variable depends on the derivatives up to (and 
including) n‘” derivatives. 
Answer: Combining the formulations of § 1.5 and § 1.7, the functional integral is: 


UL 


x2 
I lyi, ye] =| F (2, yt, Yas Yr V1» Yoo Yas Vo ) dx 
Xy 


Hence, we have two Euler-Lagrange equations: 


OF d (OF “ d? (OF 2 
Oy, dx \ Oy; dx? \ Oy! - 


OF d (OF | d? (OF d®? ( OF ae, 
dy. dx \ dys) ° dx? \ dy dx? \ dys! = 
To generalize this formulation, let denote the dependent variables with y; (¢ = 1,---,m) and the 


highest derivative of y; with ys), 


notation), the functional integral is: 


On combining the formulations of § 1.5 and § 1.7 (using this 


x2 
Lge Ym] =} F(2.nwie yf? jo Umistwas** en) dx 
xy 


Hence, we have m Euler-Lagrange equations: 
OF ; @ f{ OF , 


Note: the highest order derivative for the dependent variables y;’s may differ (as the notation n; 
indicates) and hence the highest order derivative of y; which I depends on may be the third while the 
highest order derivative of y2 which J depends on may be the second. We should also note that some 
of the derivatives between the first order and the highest order may be missing!*° (e.g. J may depend 
on the first and third order derivatives of y,; but not on the second order derivative) and hence the 
above formulation could be amended accordingly. 

4. Extend the generalized formulation of Problem 3 for the case in which the variational problem also 
includes k constraints [ Gi,--- , [ Gr. 
Answer: We have: 


x2 
I[yi,-°: Ym =| H(a.msui Go yas, SVaieUeae?? syn dx 
a 


1 
where: 


k 
H=F+S XG 


i=l 


[39] The reader should be careful in interpreting the partial derivatives with respect to the independent variables (as explained 
in § 1.6). 

[40] Tn fact, in some cases even the first order derivative (and possibly the derivatives of all orders) of some dependent variables 
can be missing. 


1.11 Summary 


Hence, we have m Euler-Lagrange equations:!+!! 


OH ; di { OH 
OY; fal dx) ay? 
1.11 Summary 
We can summarize the results of the previous sections in Table 1. 
Table 1: Summary of the results of sections 1.4-1.10. 
Variational Problem Euler-Lagrange Equation 
F(a,y,y’) aa gr) =0 or 
or a F yl! gr) = 0 or 
OF ar 1 OF NOF _ 
dy dady’ — Y Bydy’ dy” 
F(y,y') F—-/3% =C 
F (z,y') ar =C 
F (2,4) = 0 
n oF n i d' ( oF 
F (a,y,y™,-+- sy! yD) ag Doi (=1) dat (25) =0 
F(a,-°: tn Ys Yaris" Ue) on ee a (22) =0 
ECB e eS he ih k SHG) gr _ 4 (3h) =0 (ij =1,- ,n) 


F (a,y,y’) with n constraints 


[Gi fGn 


where H = F497, XG; 


F (a,y,y’) with variable boundary(s) 


Oy dx 


of 28 (35) =0 with transversality condition(s) 


Mixed nature 


Mixed formulation (as above) 


[411 In some cases, it may be required to treat ’s as variational dependent variables and hence other equation(s) may be 
added (as required). In fact, extra conditions and restrictions are required to make the formulation more rigorous (so the 
above formulation is mainly for the purpose of demonstrating the idea of mixed techniques in treating the variational 


problems of mixed nature). 


Chapter 2 
Optimal Curves 


In this chapter we present and solve problems about topics and applications of the mathematics of variation 
related to optimal curves, i.e. we are looking in these problems to certain curves (or 1D objects) that 
optimize something (such as length). In fact, the given problems represent just a sample of the variational 
problems in this category to outline the methods of tackling this sort of problems. There are many other 
problems of this type that can be considered and solved similarly. This equally applies to the problems 
of other forthcoming chapters. 


2.1 Geodesic Curves 


The objective in these problems is to find a curve of optimal length (usually shortest) that connects 
two boundaries (whether fixed points or variable curves) in a given multi-dimensional space (such as 2D 
surface or 3D Euclidean space). 
Problems 
1. Find the equation of geodesic (i.e. curve of shortest length connecting two points) on a Euclidean 
plane. 
Answer: The length s is the integral of the line element ds along the path T that connects the two 
points, that is s = fs ds. In a plane coordinated by an orthonormal Cartesian coordinate system, the 


line element ds is given by ds = \/(dx)” + (dy)* (see Figure 11) and hence the length (which represents 
the functional J that we are supposed to optimize) is given by: 


v= f enteant =f" fis (2) a= [" vine tt 


where the prime means d/dz.'47! On comparing this to Eq. 1 we can see that F (2, y,y’) = /1+y?. 
If we now apply the Euler-Lagrange equation (i.e. Eq. 2) we get:!4%) 


aJi+y?_ d (5) ara 
Oy da Oy’ 
= 2y/! ze 
dz \ 2,/1 + y 
(yin) > 
a \ Jiae 
y = D (D is constant) 


Vit? 


[42] We note that the line element ds (in Cartesian coordinates) may be cast in another form, that is: 


ds = / (dx)? + (dy)? ) t1ldy=V1+2 dy 


where the prime here means d/dy. This form may be used in solving some problems and may have certain advantages 
(such as simplifying the solution) over the common form in some cases. 
[43] Because F depends on y’ only, we can also use Eq. 4 or footnote [12]. However, we use Eq. 2 for more practice. 
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O 


Figure 11: A simple sketch depicting the setting of the Problem of shortest distance connecting two points 
(A and B) on a plane with ds representing infinitesimal arc length (or line element). See Problem 1 of § 
2.1. 


y? = D®(1+y") 

y2(1-D?) = D? 
— es (a = +,/D?/(1— D?) is constant) 
y = axt+b (b is constant) 


which is an equation of a straight line. So, the shortest distance between two points on a plane is 
the length of the straight line segment that connects these two points, (i.e. on a Euclidean plane the 
geodesic is a straight line). 
Note: it is obvious that the optimal length in this Problem is a minimum (not a maximum) because 
the length of a curve connecting two points can diverge. 

2. Re-solve Problem 1 using this time plane polar coordinates (p, ¢). 


Answer: In polar coordinates ds = / (dp) + p? (d@)” and hence: 


s= [Via + 02 (a0) = [ y'1+ p28 (do/dp) dp = [ Jit pF dp = 118} 


PB 
PA 


where the prime means d/dp. On comparing this to Eq. 1 (noting that x,y, y’ in Eq. 1 correspond to 


p,, @’ here) we can see that F'(p,¢,¢') = 1+ p?¢”. If we now apply the Euler-Lagrange equation 
(i.e. Eq. 4 noting that F' is independent of ¢ which corresponds to y) we get: 


OVIF Pe _ 


Og! 
p? g oar, 
pig”? = C? (1 on p’¢'”) 
¢? a C? 


p4 = C? p? 
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68 


C 


g’ 


e+ 0 


l| 


[2 (32 
+ arctan (=) 


C 


VF=@ 


t+tan(¢@+¢0) = G 
C tan? (¢+¢0) = p?-C? 
oe +40) +1] = # 
*sec?(d+¢0) = p* 
pcos" (P+ $0) = C 
pcos(¢+¢o) = +C 
This is an equation of a straight line as can be shown as follows: 
pcos(¢+¢9) = +C 
p(cos bcos dg — sindsingdg) = +C (trigonometric identity) 
pcos @cos do — psingsings = +C 
xcos¢dg—ysindog = +C (transforming to Cartesian) 


As we see, this is an equation of a straight line (noting that ¢9 and C are constants). 

Note: when we take the square root in Problems like this we usually use + for the sake of formality 
although C (and its alike in similar Problems) is usually arbitrary and hence it can represent both 
signs. 

. Find the equation of geodesic in 3D Euclidean space. 

Answer: This Problem is similar to Problem 1. We use Cartesian coordinates x,y,z and hence 
ds = \/(dx)? + (dy)? + (dz)? = \/1+ y2 + 22 dx (where the prime means d/dx). Hence, the distance 
(which represents the functional IJ that we are supposed to optimize) is given by: 


2B 
a= as= | 
T “LA 


So, F(a, y,2,y',2) = Jl +y? 4+ 2? (noting that this is a Problem with multiple dependent variables; 
see § 1.7). Accordingly, the Euler-Lagrange equations for the dependent variables y and z are given 
by Eqs. 19 and 20 (noting that y1, y2 there correspond to y, z here). 

The y equation is: 


l+y?+2?%dr =I ly, z] 


lty?+2? d O l+y?+2? _ 0 
Oy dx Oy! 7 
/ 
(eee a = 0 
dx /L+y'2 +4 2)? 

y! 

oe 8, Ge 
/l+ y+ 2)? 
y’? — 2 (1+ y? +2) 
(1 -C?) O? (1+ 2”) (30) 


The z equation is: 


Ce eee ee 
dx 


O l+ty?+2? 
Oz! ae 
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d / 
0 z 0.16 
dx /1 + y'? + 2! 
2! 
CC CUD 
/L + y'2 + 2! 
72 — 7? (1 ae yl’? + 2!) 
z?(1-D*) = D?(1+y7) (31) 


On substituting from Eq. 31 into Eq. 30 (see the upcoming note 1) and simplifying we get y’ = constant 
and hence y = ax + b (with a and b being constants) which is an equation of a plane. Similarly, on 
substituting from Eq. 30 into Eq. 31 (see the upcoming note 1) and simplifying we get z’ = constant 
and hence z = cx + d (with c and d being constants) which is also an equation of a plane. So, the 
solution of the system of the y and z equations is the intersection of these planes which is a straight 
line. Accordingly, the geodesic (or the curve of shortest length) in this case is a straight line. 
Note 1: from Eq. 31 we get: 

7/2 = D* (1 BE y”) 

1— D? 

On substituting from this equation into Eq. 30 we get: 


i Ce Oe Me Oi Ga) 


1 — DP 
rine = oe 2, Oe 
Re) Or 
= (014 2) (2 BY" 
y’ = constant 


We similarly obtain z’ = constant (with the exchange of y’ and 2’ and C and D). 
Note 2: it is obvious that the optimal length in this Problem is a minimum (not a maximum) because 
the length of a curve connecting two points can diverge. 

4. Find the equation of geodesic on a right circular cylinder. 
Answer: We use cylindrical coordinates p, ¢, z to represent the cylinder where the axis of the cylinder 
coincides with the z axis of the coordinate system. Now, the line element ds in cylindrical coordinates 
is given by: 


ds = J (dp)? + p? (dd)? + (dz)? 


For right circular cylinder p is constant and hence dp = 0 and p = R (with R being the constant radius 
of the cylinder). Therefore, the line element of the cylinder is: 


ds = (rR (db)” + (dz)? = /R + (dz/dd)’ dd = VR? + 2? dd 


where z’ = dz/d¢. Hence, the length of the curve I’ (which is the functional IJ that we intend to 
optimize) is: 


s= fo as= [f° Ve Rag = 1K 


where $1, ¢2 represent the azimuthal coordinates of the end points of the curve. On comparing this 
to Eq. 1 (noting that z,y,y’ in Eq. 1 correspond to ¢,z,z’ here) we can see that F = VR? 4 2’. 
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Considering that F' has no explicit dependency on ¢ (which corresponds to x in our case), we can use 
the Beltrami identity (i.e. Eq. 3 with the replacement of y/ with 2’), that is:!4] 


D4. ofa 
Rey a yOve +z _o 


RE = C?(R? +27) 
a FR 
z = +tD¢d+E (E is constant) (32) 


which is an equation of a circular helix. 

Note: the helix can represent a circular arc as a special case when D = 0 (in the case where the two 
points connected by the geodesic are on a circle). It can also represent a generator line parallel to the 
z axis when D -> oo (in the case where the two points connected by the geodesic are on a generator). 

5. Re-solve Problem 4 using this time a multiple dependent variables approach (see § 1.7). 

Answer: We use t-parameterized cylindrical coordinates p(t), ¢(t), z(t) to represent the cylinder where 
the axis of the cylinder coincides with the z axis of the coordinate system. Now, the line element ds 
in cylindrical coordinates is given by: 


ds = ¥/ (dp)” + p? (dd)? + (dz)? 


For right circular cylinder p is constant and hence dp = 0 and p = R (with R being the constant radius 
of the cylinder). Therefore, the line element of the cylinder is: 


ds = \/ R? (dd)? + (dz)? = v/ R2 (do/dt)? + (dz/dt)? dt = \/ R2¢2 + 22 at 


where the overdot represents d/dt. Hence, the length of the curve [ (which is the functional J that we 


intend to optimize) is: 
te ; 
= as= | \/ R26? + 22 dt =I(¢, z] 
r ty 


where t1,t2 represent the values of the parameter ¢ at the end points of the curve. On comparing this 
equation to Eq. 18 (noting that 2, y1, ye, y,,y4 in Eq. 18 correspond to t,¢,2z,,Z here) we can see 


that F = 4/ R242 + 52. Accordingly, the Euler-Lagrange equations for the dependent variables ¢ and 


z are given by Eqs. 19 and 20 (with the replacement of x, y1, y2, y}, yo with t, d, z, d, Z). 
The ¢ equation is: 


a R2¢2+2 gq (d/R?24+2 


Og dt Op 


[44] Noting that only 2’ appears in F' we can use footnote [12] (or Eq. 4) to conclude z’ = constant immediately. However, 
we use here the Beltrami identity for diversity and practice (as well as showing that these equations lead to the same 
result). 
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which is an equation of a circular helix. 
Similarly, the z equation is: 


Oy/R2¢2 +22 gq [ O\/ R?d? + 2 = 3 
Oz dt Oz 
esl 2% 2 
dt \ o,/R2g24 2 
Zz eee 


[R2¢2 4 32 
Bo = CER’ + CR 


or (2a) 


1-C?3 
. C3 R? 
oS ED Dz =4/—? 
z 2p ( 2 Joe =) 
z = +tDe¢+ E> (FE is constant) 


which is also an equation of a circular helix. 

So, the solution from both equations is the same (which is also the same as the solution obtained in 
Problem 4; see Eq. 32). As before (see the note of Problem 4), the helix can also represent a circular 
arc (i.e. an arc of a circle r = R) as a special case. It can also represent a generator (i.e. line parallel 
to the z axis) as another special case. 

Note 1: because a circular helix (on a right circular cylinder) passing through two given points is 
unique (within certain conditions; see note 2), all the above equations (i.e. z = +Dd¢+ E of Problem 
4and z=+D,¢+ FE, and z = +D2¢+ E> of the present Problem) represent the same helix. 

Note 2: we are assuming a given handedness (as well as being optimal in a specific sense). Anyway, 
being unique or non-unique is related to optimizing in a global sense or stationarizing (noting that in 
this regard there are some details that can be worked out rather easily). In brief, we can say more 
explicitly: although there are generally infinitely many circular helices that can pass through two given 
points on a cylinder, only one of these is optimal in the specific (global) geodesic sense with a given 
handedness. In fact, the uniqueness of geodesic (in a global sense) should be achieved if we restrict 
the magnitude of the change in ¢ for the geodesic to the range 0 < |Ad| < 7 (where Ad = ¢2 — oy 
with ¢; and ¢2 being the azimuthal coordinates of the end points) with a conventional selection of a 
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specific handedness for the case |Ad| = 7. We should finally note that part of the confusion about 
these issues may originate from the meaning of ¢ and if it belongs to the coordinate system and hence 
to the cylinder (where 0 < ¢ < 27) or it belongs to the helix that connects the two end points (and 
hence ¢ represents the “spin” of the helix regardless of the aforementioned restriction on ¢). In other 
words, whether the helix (as a function of ¢) is represented as one-to-many or as one-to-one. Also see 
Problem 6. 

6. A right circular cylinder has a radius R = 6 with its axis being aligned along the z axis of a cylindrical 
coordinate system. Find the equation of the geodesic on this cylinder that passes through the point 
(oa, 24) = (8, 1) and the point (dp, zg) = (7,6). 

Answer: Inserting the coordinates of these points into Eq. 32 (noting that the sign is rather arbitrary), 
we get: 


T 

1 = D-+E# 
ea 

6 = Dr+ek 


On solving these equations we get D = 10 and £ = —4. Accordingly, the geodesic on this cylinder 
that passes through those points is a helix (or rather helical arc) described by the equations: 


1 
er ee and p=6 
T 


Note: if we used the minus sign in Eq. 32 we get D = —10 and & = —4 and hence z = 4 — 4 which 
is the same. 

7. Find the equation of geodesic on a sphere. 
Answer: We use a spherical coordinate system r,@,¢ centered on the center of the sphere. Now, the 
line element ds in spherical coordinates is given by: 


ds = V (dr)? +r? (d0)” +r? sin? 6 (dd)? 


For sphere, r is constant and hence dr = 0 and r = R (with R being the constant radius of the sphere). 
Therefore, the line element on a sphere is: 


ds = \/ R? (d0)” + R2 sin? 0 (dd)? = Ry (d0)? + sin? 6 (dé)? = Ry/1 + 6? sin? 0 dd 


where ¢’ = d¢é/d0. Hence, the length of the curve [' (which is the functional J that we intend to 


optimize) is: 
02 
s= | as=R 14+ ¢? sin? 6 dd = I [d] 
r 61 


where 01,62 represent the @ coordinates of the end points of the curve. On comparing this to Eq. 1 
(noting that x,y,y' in Eq. 1 correspond to 6,¢,¢' here) we can see that F = \V/1+ ¢” sin? 6 (noting 
that R is a constant and hence it is no more than a scaling factor). Considering that F’ has no explicit 
dependency on ¢ (which corresponds to y in our case), we can use Eq. 4 (with the replacement of y’ 
with ¢’), that is: 


4 (\/1 +2 sin?) =a 6, 


2¢' sin” 0 aa 
2/14 d? sin? 6 
d?sint*6 = C?+4+C7¢" sin? 6 
¢” = 


sin’ @ — C2 sin? 6 
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¢? C? 
sin’ @ (1 — C? sin”? 6) 
jen C? esc* 9 
~ (1— C2 esc? 6) 
C csc? 6 
a v1— a cot? 6 ee eG) 
2 
fo csc“ 6 
ye (25$°) — cot? @ 
C2 
oe csc? 6 dd 
(455) — cot? 0 


Now, let D? = O?/(1 — C?) and w = cot 6 (and hence dw = — csc? 6 d0). On substituting these in the 
last equation and integrating we get: 


(aka acer 


@ = —arcsin(Dw) + do (do is constant) 
go -b = arcsin(D cot A) 
sin(¢o -—¢) = Decoté 
sin dp cos@—cosdgsing = — (trigonometric identities) 
singoRsinécos¢—cos¢doRsiné@sing = DRcosd (multiplying with R sin @) 
(cindy) —(cosdo)y = Dz 


where in the last line we transformed to Cartesian coordinates. As we see, the last equation is an 
equation of a plane passing through the origin (which is the center of the sphere) and hence the 
geodesic curve on a sphere is an arc of a great circle of the sphere (where the great circle is the 
intersection of the plane and the sphere). 

. Find the equation of geodesic on a right circular cone. 

Answer: We use spherical coordinates r,@,@ to represent the cone where the apex of the cone is at 
the origin of coordinates while the axis of the cone coincides with the @ = 0 axis of the coordinate 
system. Now, the line element ds in spherical coordinates is given by: 


ds = V (dr)? +r? (d0)” +r? sin? 6 (dd)? 


For right circular cone (according to the above setting), 6 is constant and hence dO = 0 and 0 =a 
(with a being a constant angle). Therefore, the line element on a cone is: 


ds = Var? + r2 sin? Qa (do)? = V(ar/ao)? 4 r2 sin? Q dd _— 4) Wey ee a dd 
where r’ = dr/dd. Hence, the length of the curve T (which is the functional I that we intend to 
optimize) is: 
p2 
s= | a= f r/2 4 72 sin? ado = I [r] 
r AE 


where ¢1, ¢2 represent the azimuthal coordinates of the end points of the curve. On comparing this to 
Eq. 1 (noting that x, y,y’ in Eq. 1 correspond to ¢,r,r’ here) we can see that F = V/r’2 +r? sin? a. 
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Considering that F' has no explicit dependency on ¢ (which corresponds to x in our case), we can use 
the Beltrami identity (ie. Eq. 3 noting that y’ in Eq. 3 correspond to r’ here), that is: 


3) 
Vri2 + r2 sin? a — r' — (vir? +r? sin? a) = C 


Or! 


Or! 
Vr? + r2 sin? a r( e = C 
2 


r!2 + 72 sin? a 


yl? 
Vr? + r2 sin? a 5 =" 30; 
Vr? +r? sin* a 


re tr sintza—r? = Cri? +r2sin? a 
rsinta = CV r24+r?sin2a 


résinta = C? (r? +r? sin? a) 
YQ r4sin* a 2.2 
re = ——_p¢*sin*a 
C2 
r? = ar* = br? (a and 6 are constants) 

r= ar’ — br? 
dr 
a ar4 — br? 
dp 
dr 
dé = —— 
ar’ — br? 


1 ar? — b 
og = Go (\ aa +D (33) 


where the result of the last line can be easily checked by differentiation (to obtain the results of the 
earlier lines). 

9. Re-solve Problem 8 using this time cylindrical coordinates (p, ¢, z). 
Answer: Let the axis of the cone coincide with the (positive) z axis of the coordinate system while the 
apex of the cone be at the origin of coordinates. Now, the line element ds in cylindrical coordinates is 
given by: 


ds = J (dp)? + p (dd)? + (dz)? 
For right circular cone z is proportional to g and hence z = kp where k is a constant. Therefore, the 
line element of the cone becomes: 


ds = y/ (do)? +p? (dd)" + (dp)? = VT ++ PgR dp 


where ¢’ = d¢é/dp. Hence, the length of the curve [ (which is the functional J that we intend to 


optimize) is: 
p2 
s=f a= f V1l+k? + p?¢? dp = I[¢] 
Er: Pl 


where (1,2 represent the p coordinates of the end points of the curve. On comparing this to Eq. 


1 (noting that z,y,y’ in Eq. 1 correspond to p,¢,¢’ here) we can see that F = \/1+k? + p?¢”. 
Considering that F' has no explicit dependency on ¢ (which corresponds to y in our case), we can use 


Eq. 4 (with the replacement of y’ with ¢’), that is: 


ar 
Og! 


= C 
pd 


/1 + k2 + p?’? 


= ¢ 
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10. 


11. 


po”? = C? te C7 k? +4 Cre 
poss C? + C?k? 
p* = C2 p? 


dp — [C2 4+.C2k? 
dp — pt — C2p? 


2 2 
V1 + k2 arctan (4) 1D) (34) 


o 
| 


Note: the similarity in form between Eq. 33 and Eq. 34 is because on a right circular cone (coordinated 
as described in Problem 8 and in the present Problem) p of the cylindrical coordinates is proportional 
to r of the spherical coordinates (since p = rsina) noting that ¢ is the same in both coordinate 
systems (assuming that the systems are in a standard configuration with respect to a corresponding 
orthonormal Cartesian system as we do). 

Find the equation of geodesic on a surface of revolution. 

Answer: Let use cylindrical coordinates p,,z where the z axis of the coordinate system is the axis 
of the surface of revolution and hence the profile curve (i.e. meridian) of the surface of revolution is 
given as p = f(z). Now, the line element ds in cylindrical coordinates is given by: 


ds = y/ (dp)? + p? (dd)? + (dz)? = Vp? + PO? + dz = VF? + PPO? + Lda 


where the prime means d/dz. Hence, the length of the curve I (which is the functional J that we 
intend to optimize) is: 


s= | as= ” [fra Pes dz = 1d] 
T Zy 


where 21,22 represent the z coordinates of the end points of the curve. On comparing this to Eq. 


1 (noting that x,y,y’ in Eq. 1 correspond to z,¢,¢’ here) we can see that F = ,/f/? + f?¢? +1. 
Considering that F' has no explicit dependency on ¢ (which corresponds to y in our case), we can use 


Eq. 4 (with the replacement of y’ with ¢’), that is: 
0 /fi2 + f2o’ +1 3,56 
Og! 
fg 
a = C? ( a f°¢” a 1) 


= Cc (35) 


ie C? f+ C? 
0 FOF 
12 
Oe 40) ata (36) 
12 1 
b= ; ata pit (37) 


which is the equation of geodesic on a surface of revolution (in cylindrical coordinates). 

Apply the result of Problem 10 on the right circular cylinder and hence confirm the result of Problem 
4. 

Answer: Using the setting of Problem 4 we have p = f(z) = R where R is the (constant) radius of 
the cylinder. So, from Eq. 37 (with f = R and f’ = 0) we have: 


07 +1 
+9 = Cf Vom 
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12. 


13. 


14. 


15. 


1 
eo OV pa cage tO 


z = +DO+E 


where the constants D and EF are defined accordingly. The last equation is the same as Eq. 32 of 
Problem 4. 

Apply the result of Problem 10 on the right circular cone and hence confirm the result of Problem 9. 
Answer: Using the setting of Problem 9 we have p = f(z) = «z where & = 1/k. So, from Eq. 37 
(with f = «z and f’ = &) we get: 


Kk? 41 
cf FETT a FEIT dz 


re f iy 
i \/ 24 — (C/K)? 2? 


CV +1 2 — (Ofn)” 


eo 


= REGIA) arctan oe + D 
ae CEE ima 2 


= V1+4(1/«?) arctan (“=| + D 
= vV1+k? arctan (2*) +D 


C 


The last equation is the same as Eq. 34 of Problem 9. 

Show that meridians of a surface of revolution are geodesic curves. 

Answer: Meridians are represented (in the cylindrical coordinates of Problem 10) by the condition 
¢’ = 0 and hence from Eq. 36 or Eq. 35 (with C = 0) we get ¢ = constant (which is the geodesic 
equation of meridians). So, meridians of a surface of revolution are geodesic curves. In other words, 
Eq. 37 (which is the equation of geodesic on a surface of revolution) represents meridians when C = 0 
(which leads to 6 = constant of integration) and hence meridians are geodesics. 

Confirm that (arcs of) great circles of spheres are geodesic curves. 

Answer: This is a consequence of the result of Problem 13 because (arcs of) great circles of spheres 
are meridians (noting the spherical symmetry of spheres). This is inline with the result of Problem 
7 (noting that the result of the present Problem is weaker than the result of Problem 7 because in 
Problem 7 we effectively have “curves on spheres are geodesics iff they are great circles” while in the 
present Problem we effectively have “curves on spheres are geodesics if they are great circles”). 

Find the shortest distance between the point (0,2) and the cubic curve y = 2°. 

Answer: This is obviously a geodesic problem (since it is about shortest distance) on a Euclidean 
plane with one fixed boundary and one variable boundary. So, the solution is obviously a straight line 
y = ax +b whose length between the point (0,2) and the point B (which is on the curve y = 2°) is the 
shortest distance (see Problem 1). Moreover, from § 1.9 we should have a transversality condition as 
well (see Eq. 23) at point B, that is:!45] 


F+(Y'-y')Fy = 0 (at point B) 


[45] In brief, in this Problem (and its alike) the straight line solution is obtained from the result of Problem 1 (because any 


geodesic on a Euclidean plane should be a straight line) and hence the job of the transversality condition is to identify 


the exact point (i.e. point B) at which the straight line and the boundary curve (i.e. the curve y = «° in our case) meet. 
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16. 


Ll+y24(Y’ / y 


V1+a? + (3a? — a) = = 0 (y=ar+band Y = 2?) 
V1+a? 


1+a?+3az?-a? = 0 (xV1+4+ a?) 
1+3az3, = 0 (38) 


= 0 (F = V/1+y’; see Problem 1) 


where in the last equation we used xg to indicate that this equation applies to point B. Now, the 
geodesic y = az + b should pass through point (0,2) and hence 2 = 0+, ie. 6 = 2. Also, point B is 
on both the geodesic y = az + 2 and the boundary curve y = «° and hence: 
3 ap — 2 
atp+2=2p > a= (ap #0) (39) 
TB 

On substituting from the last equation into Eq. 38 and simplifying we get 32% —6zg +1=0. On 
solving this quartic equation we get: 


tp ~  0.16705608755 and hence a ~ —11.94411932 and yp ~ 0.00466216 
OR 
tp ~&  1.19858497157 and hence a ~ —0.23202837 and yp ~ 1.72189428 


Now, if we compute (using Pythagoras) the distance d, between point (0,2) and point (tg, yp) = 
(0.16705, 0.00466) and the distance dz between point (0,2) and point (rg, yg) = (1.19858, 1.72189) 
we get d, ~ 2.00231887 and dz ~ 1.23042624. So, the geodesic (i.e. curve of shortest length) is 
y = —0.232028372 + 2 and the shortest distance is dy ~ 1.23042624. For clarity, we plot the result in 
Figure 12. 

Note: in Eq. 39 we excluded xg = 0 so we need to test the possibility that the geodesic is the line 
segment connecting point (0,2) to point (0,0) on the curve y = x?. However, this line segment cannot 
be the required geodesic because its length is 2 which is longer than d2. 

Find the shortest distance between the parabola y = x? and the straight line y = x — 5. 

Answer: This is obviously a geodesic problem (since it is about shortest distance) on a Euclidean 
plane with two variable boundaries. So, the solution is obviously a straight line y = ax + b whose 
length between the two end points (i.e. By on y = x? and By on y = x—5) is the shortest distance (see 
Problem 1). Moreover, from § 1.9 we should also have two transversality conditions (see Eqs. 24 and 
25 noting that in the following we do not use the subscripted version of these equations for simplicity) 
at the two end points, that is: 


F+(Y'-y)Fy = 0 (at point B,) 
/ 
Vy Sa) Z = 0 (F = V/1+y?; see Problem 1) 
/1 + y/? 
V1l+a? + (2x-a) = = 0 (y = az +b and Y = 2”) 


l+a?+(2r-a)a = 0 (x V¥1+ a?) 


1+a?+4+2azr — a? = 


1+ 2ax, = 0 (40) 
AND 
F+(Y'-y)Fy = 0 (at point B,) 
/ 
Vi+y?+(¥'-y/) u = 0 (F = ./1+y’; see Problem 1) 
/1 + y/? 
je hea tS 0 (y =ar +b and Y =2—5) 
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2 | (0, 2) ae 4 
y=-0.232x+2 ~~ ~4(1.19858, 1.72189) 
v | 
3 
: 0.16705, 0.00466) | 
=| : : : 
—1 0 1 2 3 


X 


Figure 12: The result of Problem 15 of § 2.1. The point (0.16705, 0.00466) is also plotted for clarity. The 
plot seems to suggest that d, is a local maximum. 


(xV/1 +a?) 


1+e7+(1-—a)a 
l+a+a-—a’ — 


a = -l (41) 
So, from Eq. 41 the equation of the geodesic becomes y = —a + b. Also, on substituting from Eq. 41 


into Eq. 40 we get x; = 1/2. 
Now, point B, is on both the geodesic y = —a + b and the boundary curve y = 2? and hence: 


1 1 
7 b=2? > —i+b=— 
Uy+ ry a7 1 


ie. b = 3/4. So, the equation of the geodesic becomes y = —x + 3. 
Similarly, Bz is on both the geodesic y= —x% + 3 and the boundary curve y = « — 5. Hence: 


3 3 
—fa+ 4 = 22-5 —> 2x2 =S+7 


Le. @ = 23/8. So in brief, the geodesic is y = 3 —« and the shortest distance [between point B, with 


5,4) and point Bz with coordinates (3, =)] is (using Pythagoras): 


2 2 Ly 228\" el Nfl 
(er — 22)? + (ys — yo) -/(: =) aCe =) = 1 Gp © 3:35875721 


For clarity, we plot the result in Figure 13. 


coordinates ( 
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Figure 13: The result of Problem 16 of § 2.1. 


17. Find the shortest distance between the parabola y = x? + 2 and the natural logarithm curve y = Inz. 
Answer: This is obviously a geodesic problem (since it is about shortest distance) on a Euclidean 
plane with two variable boundaries. So, the solution is obviously a straight line y = ax + b whose 
length between the two end points (i.e. B, on y = x? + 2 and By on y = Inz) is the shortest distance 
(see Problem 1). Moreover, from § 1.9 we should also have two transversality conditions (see Eqs. 24 
and 25 noting that in the following we do not use the subscripted version of these equations) at the 


two end points, that is: 
F+(Y'-y') Fy 


/T + 12 | y’ / ¥y 


a 
Vi+a? 
1+a?+ (22 —a)a 
1+a? +2ax —a* 
14 2ar, 


1+ a? 4+ (2a —a) 


iy? ea) 


V1+a? ae a 2 


(at point B,) 


(F = \/1+y”; see Problem 1) 
(y=axz+band Y = 2? +2) 
(x V1+ a?) 

(42) 
(at point B,) 
(F = 1+ y”; see Problem 1) 


(y=ax+band Y =Inz) 


(x V1 +a?) 
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18. 


l+a+ar!—a? = 0 
a — —X2 (43) 


On substituting from Eq. 43 into Eq. 42 we get: 


Now, point B, is on both the geodesic y = ax + 6 and the boundary curve y = x? + 2 and hence: 
at; +b=27+2 > b= 2? —az, +2 (45) 
Similarly, Bz is on both the geodesic y = ax + b and the boundary curve y = Ina. Hence: 
atg+b=Iln29 => b=IlInz2 — axe (46) 


On comparing Eq. 45 to Eq. 46 we get: 


zt —axj,+2 = IlInx2- are 
1 HE 
gi —at,+2 = Mm -—a— (substituting from Eq. 44) 
221 221 
1 1 
xy +2o%, +2 = In +29— (substituting from Eq. 43) 
221 221 
1 1 qo" ef 
2 . 
SO at Wg et SE: sing Eq. 44 
att Nose bt oes (using Eq. 44) 
1 1 5 
l q = 0 
"3e, | dg? 1 9 


On solving this equation numerically we get 21 ~ 0.335942278. Hence, x2 ~ 1.488350923 (from Eq. 
44), a ~ —1.488350923 (from Eq. 43), and b ~ 2.612857214 (from Eq. 45 or Eq. 46). 

So, the geodesic is y = —1.4883509232 + 2.612857214 and the shortest distance [between point B,; with 
coordinates (0.335942278, 2.112857214) and point Bz with coordinates (1.488350923, 0.397668744) | is: 


l2 


\/ (0.335942278 = 1.488350923)” + (2.112857214 — 0.397668744)* 
2.066377790 


Via ~ 22)" + (yi — yo)” 


l2 


For clarity, we plot the result in Figure 14. 

Show that the shortest distance between a point A and a (coplanar) straight line (say T';) is on another 
straight line (say I) which is perpendicular to T; (see Figure 4 but assume I, is straight). 

Answer: This is obviously a geodesic problem (since it is about shortest distance) on a Euclidean 
plane with one variable boundary (i.e. [',). So, the solution is obviously a straight line whose length 
between the point A and the point B (which is on T;) is the shortest distance (see Problem 1). Now, if 
T is given by y = ax +b and I is given by y = ax + then we should have a transversality condition 
(refer to § 1.9 and see Eq. 23) at the point of their intersection (i.e. point B), that is: 


F+(Y'-y')Fy = 0 (at point B) 
/ 
ty? 4 3) = 0 (F = /1+y”; see Problem 1) 
J/1lt+y? 
eee a a, an = 0 (y=ar+band Y =azr +p) 


v1+a? 
1+e?+ae-—a? = 0 (xV¥1+a?) 


aa = -l 
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Figure 14: The result of Problem 17 of § 2.1. 


As we see, the last equation means that the product of the slopes of [ and [’; is —1 and hence I and T'; 
are perpendicular.!4°] So, we conclude that I (whose segment between A and B is of shortest length) 
is straight and perpendicular to [';, as required. 


2.2. Fastest Descent Curves 


In this type of variational problems it is required to find the curves along which gravitated objects (i.e. 
objects in a gravitational field) descend in the shortest time. The most famous of these problems (and 
indeed the problem that kick-started the calculus of variations as a new branch of mathematics) is the 
brachistochrone problem.!47! The objective of the problem of brachistochrone in its original and most basic 
form is to find the curve of fastest descent between two fixed points. More technically, the brachistochrone 
curve is a planar curve I in the vertical plane that connects a point A to a point B (with B being below 
A but not vertically beneath it) in a uniform gravitational field so that a bead (representing a massive 
particle) at A takes minimum time to reach B (see Figure 15). The bead is assumed to be released from 
rest at A and it slides frictionlessly along I under the effect of gravity alone. In fact, the brachistochrone 
problem has a number of variations and modifications some of which (as well as the original problem 
itself) will be dealt with in the Problems of this section. 

We should finally note that apart from the property of being the curve of fastest descent (as implied by 
its name) the brachistochrone curve has other interesting properties. For example, it can be shown (see 
Problem 7) that the time taken by a bead to descend to the lowest point (i.e. the point B corresponding 


[46] Tt should be known that two straight lines are perpendicular iff the product of their slopes equals —1. 

[47] As indicated above, the brachistochrone problem marks the beginning of the calculus of variations as an independent 
branch of modern mathematics. This problem was originally considered by Galileo in 1638 but without reaching a correct 
solution. The problem was then presented in a technical form and publicized by Johann Bernoulli in the end of the 17%” 
century (around 1696) where solution was obtained by Bernoulli and other contemporary mathematicians. We should 
finally note that the word “brachistochrone” originates from Greek meaning “shortest time”. 
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Figure 15: A simple sketch representing a wire T' that connects two points (A and B) with a bead D sliding 
frictionlessly along the wire under the effect of gravity alone. The bead is released from rest at point A 
which is at the origin of an inverted Cartesian coordinate system and hence point A has coordinates (0, 0) 
while point B has coordinates (ag, yp). See § 2.2. 


to ¢ = 7 as will be clarified in Problem 1) on the curve is independent of the position of the starting point 
(i.e. the point of release A) on the curve,|48! and hence the brachistochrone curve is also a tautochrone 
(i.e. same time in Greek) or isochrone (i.e. equal time in Greek). We should also note that although the 
fastest descent (or brachistochrone) problem is the most famous of its kind (and hence it is investigated 
systematically in almost all variational calculus texts) there are many other similar time optimization 
problems. Some of these problems (such as river crossing) can be found in the textbooks and research 
papers related to the calculus of variations. 


Problems 


1. Solve the brachistochrone problem (as described above in the text). 

Answer: Let solve this problem using the setting of Figure 15 where point A is at the origin of an 
inverted Cartesian coordinate system (i.e. its y axis is pointing downward) in the vertical plane. The 
bead is initially at rest and hence its kinetic energy is zero (therefore any energy of the bead should be 
potential). Now, as the bead is released and it starts sliding it loses potential energy by descending into 
the potential well of the Earth and hence the lost potential energy will be converted to kinetic energy 
according to the conservation of energy principle (or the work-energy principle). The magnitude of 
the lost potential energy is given by mgy (where m is the mass of the bead, g is the magnitude of the 
gravitational field and y is the vertical coordinate of the bead noting that point A is at the origin of 
coordinates), and hence by the work-energy principle we have (with v being the speed of the bead): 


29 
—_— = 2g (v = ds/dt) 


e 
ll 
> 3g) 8 
& < 
ec; ec 


V2gy 
a = VityPae fas = y(ae)* + (ay = VIF ved] Ua 


V2gy 


[48] Tf should be obvious that the starting point is where the bead starts descending (i.e. where it is initially at rest). 
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tB zB /1 12 
i dt = ii Veta dx 
0 0 V2gy 


1 cB /1 12 
tp = — | ne ae (g is constant) (48) 
V2g Jo VV 


Now, since we are supposed to minimize the time tg then the integral of the last equation represents 
our functional integral I[y]. So, on comparing the last equation to Eq. 1 we can see that in this case 


F= aa (noting that 1/\/2g is a constant and hence it is no more than a scaling factor). On 


applying the Euler-Lagrange equation (noting that F’ has no explicit dependency on « and hence we 
can use Eq. 3) we get: 


y 
vv oy \ Vy 
vity? yl dy! 
VY V¥ 2/1 + y? 
/1 + y!2 y’? 
vy Vyvity? 
1 


C 
C 
C 
—__———— C 
JV 1l+y? 
VyV1l+y? = D (D = 1/C) 
y(1+y") = D? 
D? 
ees 
¥ 


y” = 


dz = Day 


y 
py 


i 
YB 
y 
tp = dy 
j DEY 
-{ 


dy (49) 


where the last equation is justified by the fact that the curve passes through the origin of coordinates 
and hence xg = x and yg = y,.!*9 

To integrate Eq. 49 we use the substitution y = D? sin? 9 and hence dy = 2D? sin @ cos 6d0. Therefore, 
Eq. 49 becomes: 


D? sin? 0 


oe 2D? sin 6 cos 6 dé 
D? — D? sin* 6 


[49] The use of y as a variable and as a limit of integration should be noticed. This abuse of notation should be tolerated 
here to avoid unwanted complications. 
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Figure 16: A simple sketch representing a cycloid I traced by a point P on the rim of a rolling wheel W. 
See Problem 1 of § 2.2. 


0 - 2 
0 
t= i a 2D? sin 6 cos .d0 
0 1 — sin“ 0 


6 A) 
sin* 6 
Lr = aa 2D? sin @ cos 6 dé 
0 cos? 6 
Oni 
sin 6 
c= i a 2D? sin 8 cos 6 dé 
9 cosé 


0 
c= p | 2sin? 6d0 
0 


6 
te = p | (1 — cos 20) d0 [using the identity sin? @ = (1 — cos 26) /2] 
0 
r= PP G —) 
2 
2 
a — (26 — sin 20) 


Now, if we use a = D?/2 and ¢ = 20 we get: 
x =a(¢—sind) and y = D’ sin? 6 = (D?/2) 2sin? 0 = a(1 —cos¢) 


where we again use the identity sin? @ = (1 — cos 20) /2. The last equation (which is a parametric 
equation of x and y in terms of a constant a and an angular parameter ¢) represents a cycloid. So, 
the required brachistochrone curve is a segment of a cycloid. 
Note: the cycloid is a curve traced by a point on the rim of a circular wheel (restricted to a plane) as 
the wheel rolls on a straight line without slipping (see Figure 16; also see Problem 8). We should also 
note that the optimal time in the brachistochrone problem is a minimum (rather than a maximum) 
because the descent time along other curves (some of which will be investigated later) is longer. 

2. Re-solve the brachistochrone problem (i.e. Problem 1) but this time assume that the bead is not 
initially at rest. 
Answer: Because the bead in this Problem is not initially at rest, it should have an initial speed v;. 
On repeating the analysis of Problem 1, we should again be led to using the work-energy principle 
(with v being the actual speed of the bead) but with an added term, that is: 


1 
=mvu. — =mvz; = mgy 
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Figure 17: A simple sketch representing a wire [ connecting a fixed point A to a given curve [, with a 
bead D sliding frictionlessly along the wire under the effect of gravity alone. The bead is released from 
rest at point A which is at the origin of an inverted Cartesian coordinate system. See Problem 3 of § 2.2. 


v = 4/2gy+v? 
ds 

— = 4/%qy+ v2 
dt eases 


d. 
ie ee, ° ee 
JV 2gy + v7 
a/1 12 ¢ 
fe és, EEO 
V2gy + 0? 
tB LB 1 12 
| dt = Basen Der 
0 0 6/2gy+ v7 
1 2B /L+y" 
tB => — dx 
V2g Jo \/y + (v7 /2g9) 


1 i vit¥? , 
— no 
V2g Jo VY 


tB 


where Y is defined by the coordinate transformation Y = y + ea (and hence y’ = Y’). As we see, the 
last equation is identical in form to Eq. 48 in Problem 1 and hence the solution of this Problem is also 
a cycloid. 

3. Re-solve the problem of fastest descent (i.e. the brachistochrone problem) but assume this time that 
the bead is sliding from a fixed point A towards a given curve [; (see Figure 17). 
Answer: In this Problem we are required to find the curve I’ for which the bead (which is released 
from rest at a fixed point A) reaches the curve I’; in the least possible time. So, this problem is of the 
type that we investigated in § 1.9 because the curve I’, represents a variable boundary (since the end 
point which is free to move along this curve is not fixed). 
We use in our solution the same setting as in Problem 1 (see Figure 17). If we follow similar analysis 
and formulation to the analysis and formulation of Problem 1, then it should be clear that F' again is 
VV 


given by F = and hence the solution here should also be a cycloid connecting the fixed point A 


2.2 Fastest Descent Curves 86 


to a yet-unknown point B on the given curve I}. However, we should have an additional transversality 
condition which is given by Eq. 23, that is: 


(l+y’) 
aa i Vaya 5 
y(1+y’?) y+?) 
dX + y!*dX +y/dY —y?dX 0 
gL yl?) 
aX +y'dY - 
y(lt+y?) 
dX+y'dY = 0 (50) 
, _ aX 
ay 
dy dX 
dx ~~ «dY¥ 
dy dY =! 1 
dx dX 


The last equation means (noting that in the transversality condition y’ belongs to T at B and dX 
and dY belong to T; at B) that the two curves T and T; are perpendicular at B.4° So in brief, the 
solution is a cycloid curve T' connecting the fixed point A to the curve [; at a point B on T'; such that 
the curves [ and I; are perpendicular to each other at point B./>4] 

Note 1: if I’, is a vertical line then the perpendicularity condition requires the slope of T at the point 
of contact to be zero (i.e. the tangent to I at point B is horizontal). This can be concluded from 
Eq. 50 (without going through the subsequent steps) because if T' is a vertical line then dX = 0 (and 
dY #0) and hence Eq. 50 becomes y/dY = 0 which leads to y’ = 0 (since dY # 0). 

Note 2: if I’; is a horizontal line (assuming this case is allowed in the formulation of the brachistochrone 
problem or it is treated as a separate case) then there are many details and situations to be considered 
(e.g. whether the point of contact between [ and T is allowed to be vertically beneath the point of 
release or not). In fact, some of these details should be subject to deliberation and research. However, 
the simplest situation is when the point of contact is allowed to be beneath the point of release in 
which case the obvious solution is a vertical straight line (i.e. free fall) where this solution can be seen 
as a limiting case to the cycloid solution (noting that even this solution meets the perpendicularity 
condition). Also, see Problem 10. 

4. Re-solve the problem of fastest descent (i.e. the brachistochrone problem) but assume this time that 
the bead is sliding from a given curve I’; towards a fixed point B (see the upper frame of Figure 18). 
Answer: For technical reasons (which are fully explained in the literature; see for instance Weinstock 
in the References), this Problem cannot be solved like Problem 3 by simple application of the above 
method (with the imposition of the additional transversality condition). However, instead of going 
through detailed technical analysis and formulation we can obtain the result (which is obtained tech- 
nically in the literature) by a simple argument, as explained in the following paragraph. 

Let the force of gravity become repulsive (instead of being attractive) and hence we are looking for 


[59] Tt should be known from elementary calculus that when two curves are perpendicular then the product of their slopes at 
the intersection point is —1. 

[51] This should be understood to set the optimization condition for a generic form of [4. Whether this condition can or 
cannot be met by a certain form of I’; (defined by certain conditions and restrictions according to the setting of a specific 
problem ) is a different story. 
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ry 


Figure 18: The three frames that demonstrate the setting and reasoning of Problem 4 of § 2.2. The dashed 
lines in the lower frame are the tangent to I, at the upper point A and the tangent to I at the lower 
point B. 
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Figure 19: A simple sketch representing the setting and reasoning of Problem 5 of § 2.2. 


the curve of fastest ascent (which we also label with I for simplicity). Accordingly, this Problem will 
reduce to the setting and formulation of Problem 3 where the bead is sliding from a fixed (lower) point 
towards (a higher point on) a given curve [; (see the middle frame of Figure 18) along a curve I where 
this curve (i.e. T’) is also a cycloid but it is now concaving downward. So, according to the result of 
Problem 3 the curve [ (in the middle frame of Figure 18) should meet the curve Ty at a point where 
their tangents are perpendicular. Hence, we obtained the shape of the brachistochrone curve (and in 
fact the solution) but for the setting of repulsive gravity. Now, let us return to the attractive gravity. 
In fact, all we need to do now is to rotate the curve [ through a (i.e. 180°) to match the setting and 
effect of attractive gravity. Accordingly, the solution of the present Problem is a cycloid (concaving 
upward) whose tangent at the lower point is perpendicular to the tangent of [, at the upper point 
(see the lower frame of Figure 18). 

5. Re-solve the problem of fastest descent (i.e. the brachistochrone problem) but assume this time that 
the bead is sliding from a given curve I’; towards a given curve [2 (see Figure 19). 

Answer: If we follow a similar argument to that used in Problem 4 then we can say that the solution 
of the present Problem can be obtained by combining the results of Problem 3 and Problem 4. In 
brief, the curve T is a (segment of) given cycloid that connects a point A on T; to a point B on 2. So, 
from the point of view of connecting point A to curve I’, the perpendicularity condition of Problem 3 
applies, while from the point of view of connecting curve [, to point B the perpendicularity condition 
of Problem 4 applies. Hence, the solution should be a cycloid that connects points A and B of equal 
slope (i.e. the slope of [; at point A is equal to the slope of Iz at point B). 

Note: in fact, there are many details and considerations that should be taken into account in tackling 
this Problem and establishing its rationale and formulation. So, the above argument is primitive and 
rough and should be treated as a pedagogical exercise. 

6. Referring to the setting of Problem 1, a bead on a cycloid (i.e. cycloid-shaped wire) given by x = 
a(¢—sin¢) and y = a(1 —cos@) started descending from point (0,0) at a given time and reached the 
lowest point on the cycloid at a later time. Find the time required for the bead to descend from point 
(11, y1) to point (x2, y2) where 0 < x1 < x2 < ar. 
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Answer: We have: 


, dy asin ddd sind 


dz = a(1-—cos¢) dd dy = asin ddd Ui area ad Eco 


Now, let ¢; and tg correspond to points (x1, yi) and (x2, y2) respectively (and similarly for ¢, and ¢2) 
and the time required for the bead to descend from point (x1, yi) to point (x2, y2) be T = te — th. 
Starting from Eq. 47, we have: 


a(1—cos¢) d¢ 


2ga(1—cos¢) 
gb2 (1- sin? 
tg-t) = i /' “el ety cos b) dp 
. 2ga (1 — cos ¢)* 
g2 ae 3 
Be: i 1 — 2cos + cos? $ + sin” Caliendo 
7 2ga (1 —cos¢)° 


$2 2—2cos@ 
= d 
T i yal esee a(1— cos ¢)d¢ 


2 12a? (1 —cos¢)* 
r =f” [o=eaal 
¢ 2ga(1— cos ¢) 


re [fe 
pe ‘E Cm (51) 


Note: as we see, the last equation means that the time of descent between two given points on this 
curve (assuming the motion started at the origin) is proportional to the angular displacement (¢2— 1). 
For example, the time of descent from the point corresponding to ¢; = 0 to the point corresponding 
to dy = 7/2 is the same as the time of descent from the point corresponding to ¢, = 7/2 to the point 
corresponding to ¢2 = 7. Now, if we note that the angular displacement is proportional to the x 
coordinate of the point of contact of the “wheel” that generates the cycloid (since x, = ad where ry 
is the x coordinate of the point of contact of the “wheel”) we can imagine the movement of the bead 
as if it is the result of being attached to the rim of a wheel that moves uniformly (i.e. with constant 
speed) on the & axis. 
7. Show that the brachistochrone curve is also a tautochrone (or isochrone) curve. 

Answer: In this answer we again use the setting of Problem 1. Our objective here is to show that the 
time taken by a bead to descend to the lowest point (21, y) = (a7, 2a) on the curve is independent of 
the position of the point of release (from rest) on the curve. Let note first that according to Eq. 51 
the time of descent of the bead to the lowest point (i.e. the point corresponding to ¢2 = 7) on the 
brachistochrone assuming the bead is released (from rest) at the origin of coordinates (i.e. the point 
corresponding to ¢; = 0) is 7,/a/g. Now, let assume that instead of releasing the bead (from rest) 
at the origin of coordinates the bead is released (from rest) at a lower point on the cycloid [say point 


dp 
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(as,Ys) where 0 <a, <a and0< y, < yi] and hence Eq. 47 becomes: 


2. 
fay VY1t+y’ d 
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Figure 20: A simple sketch representing a cycloid [ generated by a point P on the rim of a rolling wheel 
W of radius a. As can be seen, the x and y coordinates of the point P during the rolling of the wheel are 
tp = ad —asing and yp = a—acos®@. See Problem 8 of § 2.2. 


a a 2/8 [0+ 5) 
g 2 


ft, = ™/— 
g 
So, according to this equation the time of descent (i.e. t; —t,) to the lowest point is independent of the 
position of the release point. In fact, this time is the same as the time of descent if the bead is released 
from the origin of coordinates (as we noted earlier) which is logical since the release from the origin of 
coordinates is just a special case of a (general) release point. So, we conclude that the brachistochrone 
curve is also a tautochrone (or isochrone) curve, as required. 
8. Show that the cycloid is a curve traced by a point on the rim of a circular wheel (restricted to a plane) 
as the wheel rolls on a straight line without slipping. 
Answer: From the construction of Figure 20 (where the wheel W of radius r = a generates the cycloid 
I) we can easily see that the z and y coordinates of point P (i.e. the point that generates the cycloid) 
are: 


tp = ad—asind=a(¢d-—sing) 


yp = a-—acos¢=a(l—cos¢) 


We note that the signs of the trigonometric functions over the entire range of ¢ (which is of fundamental 
cycle 0 < ¢ < 27) take care of all the possibilities. 

9. Using the setting of Problem 1, find the equation of the curve of fastest descent between the points 
(0,0) and (1,1) and plot it. 
Answer: From the result of Problem 1, the curve of fastest descent is a cycloid given by the parametric 
equations z = a(¢—sin@) and y = a(1—cosd¢). Now, the point (1,1) should satisfy these equations 
and hence we have: 


a(@—sing) =1 and a(1—cos¢) =1 (52) 
On comparing these equations we get: 


a(g—sin¢d) = a(1—cos¢) 
o-—sing = 1l-cos¢ 
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On solving this equation (numerically or analytically) we get ¢ ~ 2.412011144. On inserting this value 
of ¢ into one of the equations of Eq. 52 we get a ~ 0.572917037. So, the curve of fastest descent that 
connects the points (0,0) and (1,1) is given by the parametric equations: 


x = 0.572917037 (¢ — sin 4) and y = 0.572917037 (1 — cos ¢) 


where 0 < ¢@ < 2m for a complete cycle. This curve is plotted (for the range 0 < ¢ < 27) in Figure 21. 
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Figure 21: Plot of the curve of fastest descent that connects the points (0,0) and (1,1) for the range 
0 < @ < 2x where the point (1,1) is marked. See Problem 9 of § 2.2. 


10. Find the time of descent from point (0,0) to the lowest point on the curve of Problem 9 (assuming 
SI units). Also, compare the time of descent of this Problem to the time of free fall and the time of 
descent on the straight line that connects the two points, i.e. point (0,0) and the lowest point. 
Answer: Regarding the time of descent from point (0,0) to the lowest point on the curve of Problem 
9, we use Eq. 51 where the point (0,0) corresponds to ¢; = 0 while the lowest point corresponds to 


ob2 = 7, that is: 
57291 
= i oe =i) =4] oe ae (x — 0) ~ 0.7596 s 


The time of free fall (i.e. vertically from y, = 0 to y, = 2a) is 


/2 2 
nm) = ye Ca 3 = n=/e (2 x 0.572917037) ~ 0.4836 s 


Regarding the time of descent on the straight line that connects the two points, we use Eq. 47 (noting 


that for straight line the slope y’ is constant which in our case is y’ = Ru = 24 = 2 andy =y/x = 22) 


Ax ant 
and hence we have: 
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4g 
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0.572917037 x 1? (1+ 4) 
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l2 


0.90046 s 


As we see, the shortest is the free fall time, followed by the time of descent on the brachistochrone 
curve, followed by the time of descent on the straight line. 

Summarize the main properties of the brachistochrone curve that we obtained in this section. 
Answer: We note the following: 

e It is the curve of fastest descent between two points (within the stated conditions and considering 
variant cases). This property is a “consequence” of its characteristic as a brachistochrone (see the text 
and Problem 1). 

e The time of descent on this curve is proportional to the angular displacement (see Problem 6). 

e The brachistochrone curve is also a tautochrone (i.e. same time) or isochrone (i.e. equal time) curve 
(see Problem 7). 


2.3. The Catenary 


The objective in the problem of catenary (which originates from a Latin word meaning “chain”) is to find 
the shape of the curve I that an inextensible (but flexible) hanging cable or chain (of uniform linear mass 
density and supported only at its two ends in fixed positions A and B) will take when it is under the 
influence of (uniform) gravity alone (see Figure 22). As we will show, the catenary has the shape of a 
hyperbolic cosine curve. In fact, the problem of catenary can be solved by several methods and techniques 
(including ordinary calculus) as we will see in the Problems.|>?! 


Problems 


1: 


Solve the problem of catenary as a variational problem (by using the calculus of variations). 
Answer: We use a Cartesian coordinate system where the chain is in the xy plane (see Figure 22). In 
its equilibrium position the chain should take the shape that minimizes its potential energy (where its 
center of gravity is in the lowest position). Now, the potential energy dU of an infinitesimal segment 
of length ds and mass dm is dU = gydm = gypids where g is the magnitude of the gravitational field, 
y is the height of the segment (say above the ground if we take the ground level as the zero-energy 
reference level) and yu is the linear mass density of the chain. The potential energy U of the chain is 
the sum of the potential energies dU of the infinitesimal segments of the chain and hence it is given 
by the following integral (which is the functional that we intend to extremize in this Problem): 


u=f[ w= | oynas=su [ yds = gu | yVl+y? dz = Ty] (53) 
T T T x 


A 


Accordingly, F (x, y,y’) = yV1l+y”. Since F has no explicit dependency on x we use Eq. 3 (i.e. 


[52] We note that many (if not most) problems in physics and mathematics (including the calculus of variations) can be solved 


by various methods and techniques and hence the problem of catenary is not an exception in this regard. However, the 
catenary problem is used here as a case study (due to its simplicity and wide use) for demonstration and practice. Some 
(few) other Problems in this book are also solved by different methods and techniques for the same reasons. 
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Figure 22: A schematic illustration of the setting of the problem of catenary where a chain [' of uniform 
linear density is hanging freely from its two fixed ends (A and B) in a uniform gravitational field g. See 
Problem 1 of § 2.3. 
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Beltrami identity), that is: 
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(y/C)" -1 
x = Carccosh (4) +D (D is constant) (54) 


On solving the last equation for y we get the profile of the hanging chain, that is: 
x—-D 
= C cosh | —~— 55 
y = Ceosh (77 ) (55) 


This curve (which has the shape of a hyperbolic cosine) is called catenary. The two constants in this 
solution (ie. C and D) can be determined from the two boundary conditions at the two end points of 
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the curve (see Problem 4). 
Note 1: this Problem may be solved rather differently by using the other form of ds (i.e. ds = 


V1+4 27 dy), that is: 
YB 
u=[ w= | oynas=su f yds = gu | yV1+ 2 dy = Iz] (56) 
iB r r y 


A 


where the prime means d/dy (see footnote [42] ). Accordingly, F (y,z,2') = yV1+ 2’. Now, if we use 
Eq. 2 (with exchange of x and y) then we have: 
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which is the same as Eq. 54 and hence it leads to Eq. 55. 
Note 2: it is obvious that the optimal in this Problem is a minimum (as required by the physics) and 
not a maximum because other configurations of the chain will increase the potential energy. 

2. Re-solve the problem of catenary as a variational problem with constraint (see § 1.8) where the con- 
straint comes from fixing the length of the cable. 
Answer: As before (see Problem 1), we have F = y,\/1+ y’? but we have an added condition which 


is the constraint [ Gdax = [ \/1+ y? dx and hence: 


H(a,y,y') =F +AG=yV1+y2+aAV1+y2 = (y+) V1 ty? 


Now, by a suitable transformation!®?! we can bring H to the form Y/1 + Y” which is the same as the 
form of F' in the catenary problem without constraint (see Eq. 53). Therefore, the solution should also 
be a hyperbolic cosine (or catenary) as in the case of having no constraint (i.e. Problem 1). In fact, 
this should be obvious because there is no physical difference between the two cases; the difference is 
only in the formulation which should not affect the nature of the solution. 

3. Re-solve the problem of catenary using this time ordinary calculus (instead of the calculus of variations). 
Answer: Referring to Figure 23, a tiny element of the cable should be pulled down by the force of its 
weight Fi, = gus where g is the magnitude of the gravitational field, y is the linear mass density of the 
cable and s is the length of the element. This element should also be pulled horizontally (to the left 
according to the setting of Figure 23) by a horizontal force F), (where for clarity this force is shown at 
the lowest point of the cable noting that it is independent of x). Now, this element is in equilibrium 
and hence there should be a tension force F, whose horizontal component |F,| cos ¢ balances F}, while 
its vertical component |F,|sin ¢@ balances Fy, that is: 


IF;|cosé = Fh 


[53] Tf y= Y — \ then y’ = Y’ and hence H = YV1+4+Y”. 
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Figure 23: Illustration of the setting of the catenary problem (for the solution by ordinary calculus). See 
Problem 3 of § 2.3. 
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where C = an and D = ik E . This is the same as Eq. 55 and hence the solution from ordinary 


calculus is the same as the solution from the calculus of variations (i.e. hyperbolic cosine or catenary). 
4, Given that the catenary of Problem 1 passes through the boundary points (1,1) and (2,2), find the 
equation of this catenary and plot it. 
Answer: We solve Eq. 55 (or Eq. 54) for D, that is: 
D=z2ax —Carccosh (4) 
C 


On substituting the coordinates of the two boundary points into this equation we get: 


D 


1 
1-C h( = 57 
arccos ( z) (57) 


D 


2 
2-C h | — 58 
arccos ( =) (58) 
On subtracting Eq. 57 from Eq. 58 we get: 


2 1 
1 — Carccosh (z) + Carccosh (z) =0 


On solving this equation for C (using a numerical solver) we get C ~ 0.949988827 and hence D ~ 
0.693083213 (where we use Eq. 57 or Eq. 58 to find D). Accordingly, the equation of this catenary is: 


x — 0.693083213 


y ~ 0.949988827 cosh ( 0.949988827 


) ~ 0.949988827 cosh (1.052643959a — 0.729569857) 


The catenary is plotted in Figure 24. 


2 


Figure 24: Plot of the (segment of the) catenary that passes through the boundary points (1,1) and (2, 2). 
See Problem 4 of § 2.3. 
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2.4 Isoperimetric Problems 


Isoperimetric means “having equal perimeter” and hence these problems primarily deal with situations 
where we have a perimeter of fixed length that can enclose planar surfaces of different shapes and of 
different areas. Some authors define isoperimetric problems in a more general way (see for instance Byerly 
book in the References). In this book we classify under this type of variational problems those problems 
in which a correlation between length and area is presumed where one imposes a constraint on the other. 
Anyway, these various classifications, definitions and conventions have very little significance since they 
have no impact on the mathematics and hence we should not be distracted by these minor issues. 

In fact, this type of variational problems includes some of the oldest problems in the history of variational 
thinking and they actually precede the birth of the modern calculus of variations in the 17'” century. The 
prominent representative of the ancient isoperimetric problems is the Dido problem (which originates from 
a story in the ancient Greek mythology) where the challenge is to find the planar geometric shape with 
a perimeter of a given length that encloses maximum area (e.g. what is the shape of a 100 m fence that 
encloses a flat wheat field of maximum area). In the following we investigate a number of isoperimetric 
problems (some of which are so simple that they can be solved using ordinary calculus). We should finally 
note that the classification of isoperimetric problems under the optimal curves (which is the title of the 
present chapter) may not be ideal but it is of little significance (as indicated earlier)./4! 


Problems 


1. Find the shape of a planar open curve I of a given length / that encloses maximum area between it 
and a straight line passing through its end points A(a4, ya) and B(xg, yg).°! 
Answer: Let the straight line be the x axis and let parameterize the curve by a natural parameter s 
representing arc length (see Figure 25). Now, the enclosed area (which we want to maximize) is given 


by the integral: 
xB SB 
a= | do= [ yar = | yV1—y?ds=I 
LA SA 
where in step 3 we used dx = \/1 — y’? ds because (ds)* = (dx)? +(dy)’and hence dx = 4/ (ds)* — (dy)? = 


V1-—y’ ds (with the prime representing d/ds). Accordingly, F(s,y,y’) = yV1—y?, and since it is 
independent of s we use Eq. 3 (noting the correspondence between s and x and the significance of the 
prime), that is: 


re) 
(y 1 y?) Y oy (y 1 y”) = C 
—2y/’ 
1—y?—y' mars. 
yVl-y aC 2] 
2 
12 yy! — 
y ' oro} 1—y! Sg 
ieee 12 + 12 
y(1—y?) + yy? s 
1 y’? 
y = CyV1-y? 


[54] Ty fact, most of the Problems in the present section are about curves of given length that enclose or border optimal areas 
although in some Problems we investigated the opposite (i.e. given areas enclosed by curves of optimal length) which 
may be more appropriate for the classification and inclusion of this section in the “Optimal Curves” chapter. 

[55] In this Problem the two end points of the curve are not fixed, i.e. the Problem is based on assuming a loose curve whose 
end points are free to move on a straight line such that the enclosed area is maximum. In fact, fixing one point and 
leaving the other free should be sufficient. 
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Figure 25: A simple sketch demonstrating a planar curve [ of a given length / that encloses maximum 
area a between it and the x axis. See Problem 1 of § 2.4. 


ds C 
+d 
= =. gg 
yt- (8) 
+C arcsin (4) = s+D 
C 
s+D 
= itCsi a 
y sin ( G ) 
Now, at point A we have s = 0 and y = 0 and at point B we have s = / and y = 0, that is: 
D 1+D 
0=2tC sin (2) and 0=4C sin (47) 
that is: 
D 1+D l 
eam and oe ee + mr = nt (m and n are integers) 
So, C= mye and D=Cma = car = ny: Now, to simplify the solution we make n —m = 1 


(because m and n are arbitrary) and make D = 0 (because it is just a shift in s and hence it is also 
arbitrary).°! We also note that the curve is above the x axis (and hence y is positive). Accordingly, 
the solution becomes: 
l . 7s 
y=-—sin() (0<s<1) 
T 


Now, we have: 


1-(4) = cos? (4) [(as)? = (de)? + (dy)? 


[56] Tn fact, this may be seen as choosing a specific solution more than a simplification although this should not affect the 
generality of the final result. 
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Ny 
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l 
ds l 
tS cos (™) + EB (E is constant) 
T l 
eal TS 
z-E = +_ cos (“*) 
Therefore: 
l 
SPP aay? ef S 
(e—E)'+y - 


(@-E)’+y = 


( 
@- By +y = ( 
( 


1\2 
) (59) 
T 

which is an equation of a circle with center (F,0) and radius //7. Accordingly, the curve has the shape 
of a circular arc. In fact, the curve is a semi-circle of length | and center (E£, 0). 

Note: the optimal area in this Problem is a maximum (not a minimum) because the enclosed area 
can approach zero when the curve T approaches the straight line (and hence [I becomes effectively 
straight). The enclosed area can also approach zero in another extreme case when the two end points 
of the curve approach each other and the two sides of the curve also approach each other. This result 
will also be confirmed in Problem 2 (assuming the curve to be a circular arc which is the result that 
we obtained in the present Problem). We should also note that if the curve is shaped as a complete 
circle (i.e. with A and B being the same point) then its area (which is 0 = =) is half the area of the 
semi-circle (which is ¢ = ES, 

. Assuming that the curve in Problem 1 is a circular arc, show formally that it is a semi-circle.|>7] 
Answer: The area of a segment of a circle is given by: 


1 : 5 1 . PO. Poet sind 
o= 5 (9—sin@)r ait sin®) a = 5 F 7 
where in the second step we used r = 1/0 (since 1 = ré with r being the radius of the circle, @ being the 
central angle subtending the segment, and | being the length of the circular arc subtending 6). Now, 
to find the optimal value of o we take its derivative with respect to 6 (since it is the variable noting 
that I is fixed) and set it to zero to obtain the optimization condition, that is: 


2 do 
Ba 
1 cos @ sin 6 
62 62 +2 gs. 0 
6+6@cosé—2sind = 0 


The solution of this equation in the interval (0, 27] is 0 = 7, i.e. the segment of optimal area subtends 
an angle 9 = 7 which means that the circular arc is a semi-circle and the segment is a semi-disc. This 
result can also be confirmed graphically by plotting o as a function of @ (see Figure 26 where o is given 
in units of 17). 


[57] Noting the setting of Problem 1, Eq. 59 should be sufficient for this demonstration. So, the present Problem is about 


another way for the required demonstration. 
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Figure 26: Plot of o (in units of 1?) as a function of 6 where the peak of the curve at (7, 3¢) is marked. 
See Problem 2 of § 2.4. 


We should finally note that we can show formally that the optimal area in this Problem is a maximum 
(not a minimum) by employing the second derivative test, that is: 


da RD , 2008 | sing 6sin | 2cosé 
7 eC i ey 
2/2 cost sinz G6sint 2cosT 
- + + 
2 \7n3 73 ‘i a4 7 
[? 2 2 2 
— 0—0 
2 (2 73 = 3) 


2 
= —4 <0 
73 


Hence, the optimal area is a maximum. 
3. Given that the length of the curve in Problem 1 is | = 5, find the maximum enclosed area. 
Answer: As seen in Problems 1 and 2, the curve is a semi-circle of radius r = L and hence the 
maximum enclosed area is: 
(ae eae ee i? i 


25 
o= BAT — 5773 ae ae 3.97887 


4, Find the shape of a planar curve [ of a given length / that has maximum area beneath it (and above 
the x axis) with two fixed end points A and B (see Figure 27). 
Answer: We are supposed to maximize the area beneath the curve subject to the constraint that 
the length of the curve is equal to / and hence we use the Lagrange multipliers technique (see § 1.8). 
Now, the area is given by the integral i y dx while the length of the curve is given by the integral 
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Figure 27: A simple sketch demonstrating a planar curve I of a given length / (connecting two fixed points 
A and B) that has maximum area o beneath it. See Problem 4 of § 2.4. 


Loe 


iG (dx)” + (dy)? = fA? de: So, F = y and G = \/1+y” and hence H = F 4+ AG = 
y+A/1+y2. Using Eq. 22, we have:l® 


lI 
ro) 


(vei) - 2 (5 [yt avirl 


See NR SS NO 
| | I 
j=) 


/ 
1 d Ay _ 9 
dx ie yl? 
d Ay’ =) 
dx 14+ yl? 
/ 
Ay = r+C, 
/14+y/? 
My? = (@+C\)? (1+y”) 
y? [P-(@+O))] = @+C) 
ii _ r+C, (60) 
Me +O)? 
y = pee (on OP een Os 
(c+C\)?+(y-G)* = » (61) 


As we see, the last line is the equation of a circular arc with center (—C1, C2) and radius ||. 
Note 1: the three parameters C), C2, A in the above solution can be determined from the two boundary 
conditions (i.e. the coordinates of the end points A and B) plus the constraint on the length 1 (see 


[58] In fact, we can also us the Beltrami identity (Eq. 3) with H replacing F’. 
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Problem 5). 

Note 2: assuming the curve to concave downward the optimal area is a maximum (not a minimum) 
because the area can become a minimum when the curve concaves upward. In fact, the technicalities 
of the problem (supported by the associated conditions) should take care of these issues. 

Note 3: we are assuming certain restrictions on the length of the curve (i.e. limits on its minimum 
and maximum) for this Problem and its solution to apply. The details can be easily worked out in 
each particular problem. 

5. Gen that the curve in Problem 4 passes through the points (2,4) and (5,1) and the length of the arc 
is =, determine the unknown constants in the solution and hence obtain the specific solution. Also, 
piGt the curve and find the area beneath the arc. 

Answer: On inserting the coordinates of the points (2,4) and (5,1) into Eq. 61 we get: 


(2+0))?+(4-GQ,) = »# (62) 
G40) doy S32 (63) 


On subtracting Eq. 63 from Eq. 62 we get: 


(QO) +4 Oy 640) S1-Gy = ov 
4+ 4C, + CO? + 16 — 802 + C2? — 25-100, — C2? -14+20,-—C? = 0 
—6-—6C,—6C, = 0 

Cy = -1-C) (64) 


Also, by adding Eq. 63 to Eq. 62 we get: 


(2401)? + (4-G:)? + (54) +(1-Ce)? = 2)? 
4+4C, + C7 +16 —8C2 + CZ +254 100; +C7+1—2C,+CF = 2)? 
46 + 140, + 207 —10C2 +207? = 2 
23470, +C?2-50,+C7 = 
98 470, £0? 5 (1 -C)4+CIH ay = (from Eq. 64) 
28 + 7C; + OF +5450, +14+2C,+C? = 
29+14C,+2C? = » (65) 


Now, the length of the curve is given by | = is JV1+y? dx where y’ is given by Eq. 60, that is: 
RS fy (a + (tC)? ae 
(a + ca 
2 } 2 
30 : dN? = (x ae + (a + C1) de 
2 (a + Cy 
3m ee 
2 (a + Meee 
3m [y- 
Se 3s: z ——— d 
ay 
3 
> = iow (“5 . 
bal A |arcsin some) arcsin ame 
2 Xr r 
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37 
= = 29 + 14C; + 203 


: 5+ Cy ; 2+C, 
arcsin arcsin 
4/29 + 14C, + 2C? /29 + 14C, + 20? 


where in the last equation we substituted for \ from Eq. 65. On using a numerical solver we find 
C, = —2 and hence C2 = 1 (from Eq. 64) and \ = 3 (from Eq. 65). Therefore, the specific solution 
is (x — 2)? + (y—1)? = 9. The circle is plotted in Figure 28. Regarding the area beneath the arc, it 
is obvious that the central angle that subtends the arc |i.e. the angle between the points (2,4), (2, 1) 
and (5, 1)] is a right angle and hence the area of this sector is one quarter of the area of the circle, i.e. 
it is 2=. We should also add the area of the rectangle [whose vertices are the points (2,0), (5,0), (5,1) 
and (2, 1)] which is equal to 1 x 3 = 3. Therefore, the area beneath the arc is 0 = on +3 ~ 10.06858. 


Figure 28: Plot of the circle (x — 2)” + (y— 1)” = 9 where the area o under the circular arc is shown in 
gray. See Problem 5 of § 2.4. 


6. Find the shape of the closed plane curve of a given length that encloses maximum area. 
Answer: We use a 2D Cartesian system (in the plane of the curve) whose origin O is inside the curve 


(see Figure 29). Now, an infinitesimal sector OAB can be represented by a triangle OAB and hence 
its area is given by: 


1 1 1 
dr = 5 | de| = 5 |(0, 9,0) x (de, dy, 0)| = 5 (way ~ yd) 


Therefore, the area inside the curve is: 


o = fdo=§ 5 (edy— yar) =p 5 (ay -¥) ae 
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Figure 29: A simple sketch representing a closed planar curve I enclosing an area a. See Problem 6 of § 
2.4. 


Similarly, the length of the curve is given by the integral ¢ 4/ (dx)? + (dy)? = f /1t+y? dz. 


Now, we are required to maximize the area subject to the constraint that the length is constant. So, 
we use the Lagrange multipliers technique (see § 1.8) where H = F + AG = $ (zy! —y) +AV14+y?. 
Using the Euler-Lagrange equation (Eq. 22), we have: 


0 {1 


ay {2 nvr a (ay [2 + ayiry?| 


dx \ Oy’ |2 


l| 
o 


1 dfa,_ dy ania 
2° dx \2 l+y2} 
d {x 63 Ay’ ee: 
dx \ 2 l+y2}/ 2 
x dy’ _ 
tt ae ae +C 
Ay! = C-2 
Tab y’ — 
My? = (C—2) (1+y") 
My? —(C—a)’y? = (Cs) 
2 (C = a)? 
d2 — (C — 2)? 
y = (C 7 2) 
VX -(C- 2)? 
y = +\/d\-(C-2)?+D 
(C-2)’+(-D)y = »¥ 
(e—C)+(y-D) = »¥ 


This is an equation of a circle with center (C,D) and radius |X|. So, the shape of the curve that 
maximizes the enclosed area is a circle (noting that the solution cannot be a minimum because the 
area enclosed by a squeezed shape can approach zero). In fact, this solution is also intuitive and hence 
it was reached even by the ancient scholars using very simple arguments and reasoning. 
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Figure 30: A simple sketch demonstrating the setting of Problem 8 of § 2.4 where a planar curve [ of 
shortest length (connecting two fixed points A and B) encloses fixed area o between it and the x axis. 


7. Find the equation of the closed plane curve of length | ~ 4.18879 that encloses maximum area o and 
passes through the points (—0.5, —0.83333) and (0,—1.05904). Also find the enclosed area. 
Answer: From the answer of Problem 6 we know that this curve is a circle of radius 4 and circumference 
1. Accordingly, | = 4.18879 = 27 and hence » = 0.66667 = 2/3. Therefore, the equation of the curve 
becomes (a — C)? +(y— D)? = 4. On substituting the coordinates of the points (—0.5, —0.83333) and 
(0, —1.05904) into this equation we get: 


(0.5 CO)? (0.63338 =D)" 


Ole ols- 


C? + (—1.05904 — D)? 


On solving these equations simultaneously (e.g. by substitution from the second into the first) we get 
C = —0.5 and D = —1.5.59 So, the equation of the curve is (x + 0.5)” + (y + 1.5)" = 4 . The enclosed 
area is 0 = 7A? = a ~ 1.396263. 

8. Find the shape of the planar curve of shortest length that connects two given points [say point (x1, y1) 
and point (x2, y2)] such that the area beneath it (and above the x axis) is a given constant (see Figure 
30). 
ee We are required to minimize the arc length s of the curve I which is represented by the 
function y = y(x) subject to the area constraint o = a (where a is a given constant). Now, the 
length is given by s = f, ds = ieee J/1+y2dx = I, while the area is given by o = i ydx = Ip. 
So, on using the Lagrange multipliers technique (see § 1.8) with F = /1+y”? and G = y we have 
H=F+)AG=\/1+4+ y? +Ay. Hence, the Euler-Lagrange equation (noting that H is independent of 
x and hence we can use Eq. 3 with H replacing F’) is: 


H-y— = C 
[Er +90] a gy [VE + = 


yf Lye A = 
y y— Ia 


[59] Tn fact, there is another solution (i.e. C = 0 and D ~ —0.392373) which we do not consider here. 
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L+y?+rAyV1+y?—-y? = CV1+y? 
1l+rAyV1lty? = CV14+y? 
= 
Tty?2 = 
u wy C 
1 
ee 12 = 
: Qy— cy 
, 1 
= + =i 
‘ Oy — Cy 
a a => dx 
Gee 
Loe Oy 
) = £+D 
LOC) 
2 = (x + D)? 
1 Coy 
bo (0- 9) = eon 
: Gro. al 


As we see, this is an arc of a circle with center (—D,C/X) and radius |1/)|. 

Note: the constants CD, can be determined from the information about the two boundary points 
and the constraint ¢ = f°’ ydx = a (see Problem 9). 

. Given that the curve in Problem 8 passes through the points (— 2,0) and (V2, 0) and the area beneath 
it and above the x axis is 1.1416, determine the unknown constants in the solution and hence obtain 
the specific solution. Also, plot the curve and find the length of the arc. 

Answer: As we found in the answer of Problem 8, the curve is (an arc of) a circle and hence its 
equation is (# —c1)” + (y—c2)” = R? with (c1,c2) being the center and R being the radius. So, 
instead of determining C,D,. we determine c),c2,R and hence obtain the specific solution. Now, 
from the symmetry of the circular segment [as can be seen from the points (—V/2,0) and (V/2,0)] we 
know that the center of the circle is on the y axis and hence c, = 0. Thus, the equation of the circle 
becomes x? + (y — C2)” = R?. We also know from elementary geometry that the area o of a segment 
of a circle with radius R is given by: 


ee: 2R? arcsi : ly / R2 l\" 
OES resin | 55 ; 


where I is the length of the chord (i.e. the straight base of the segment) which in our case is equal 
to 2\V/2. On using a numerical solver (with ¢ = 1.1416 and | = 2/2) we find R = 2 and hence the 
equation of the circle becomes x? + (y — co)” = 4. We finally use one point [say (V2, 0)] to find c2, 
that is: 


2 
(v2) + (0-2)? =4 and hence e = —V2 


where we take the negative root for obvious geometric considerations. Therefore, the specific solution 
is 27+ (y + 2)" = 4. The circle is plotted in Figure 31. Regarding the length of the arc, it is obvious 
that the central angle that subtends the arc [i.e. the angle between the points (— 2,0), (0, -V2) and 
(/2, 0)] is a right angle and hence the arc is one quarter of the perimeter, i.e. it is 7. 
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Figure 31: Plot of the circle x? + (y + /2)’ = 4 where the segment of area o under the circular arc is 
shown in gray. See Problem 9 of § 2.4. 


10. Given that the curve in Problem 8 passes through the points (—1,5) and (4,0) and the area beneath 


it and above the & axis is 26 determine the unknown constants in the solution and hence obtain the 


specific solution. Also, plot the curve and find the length of the arc. 
Answer: As in Problem 9, the curve is a circle and its equation can be written as: 
(ea) +(y-@) =P (66) 


where (ci, C2) is the center and R is the radius. Now, from the points (—1,5) and (4,0) we get: 


(ise) 4 65) = (67) 
(4—c)?+(0—@)? = R? (68) 


On subtracting Eq. 68 from Eq. 67 we get: 


(-1-—c)?+(6-—@)? -(4-a)?-(0-—@)? = 0 
1+ 2¢, +2 +25 — 10 +4—-16+8¢ —c?-& = 0 
10+10c; —10c2 = O 

CG = qt (69) 


Also, on adding Eq. 68 to Eq. 67 we get: 
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14+ 2c +¢$+25-10e2+e¢4+16-8 +c7+c3 = 2R? 
21 — 3c, 4 ct 5c2 4 ca = R 
21-—8ce, + —5(e. +1) +(e. +1)? = FP? (substituting from Eq. 69) 
21 —3e +c) —5e, —S+e¢+2q +1 = FR 
2¢—6e +17 = R? (70) 


On solving Eq. 66 for y we get y = 4/ R? — (a — eye + 2 where cy and R? are given by Eqs. 69 and 
70 (in terms of c,). Accordingly: 


+4 
i. y dx 


, ae ce +o dx 


1 ; Lr-C 1 r-Cc a 
5? arcsin ( 2) 5 R(x a yt ( 7) + cox 


1 4— 1 Aa \* 
5B arcs ( qo) + 5RU ci) 4/1 ( z) + C94 


Qq 
II 


|S 

3 
lI 

o> 


|S 
ee 
3 
II 
a (| 
| 


1 £2 
5 arcsin ( 


25 1 4— 
di 5 (2c7 6c; 4 17) sei ( a - 


2c? — 6c, +17 


2 
1 4-c 
,/2c? — 6c, + 17(4-—e 1 tA(c; +1) — 
ey —— aie 


=a 
(2ci — 6c, + 17) arcsin a7 
Pier — 6c, +17 


2 
1 -l-q 
~4/2c? — 6c; + 17(-l—-e 1 (cy +1 
pee Se 7 —— oe 


where in the last equation we substituted for cp and R from Eqs. 69 and 70. On using a numerical 
solver we find cy = —1 and hence cz = 0 (from Eq. 69) and R = 5 (from Eq. 70). Therefore, the 
specific solution is (x +1)? + y? = 25. The circle is plotted in Figure 32. Regarding the length of 
the arc, it is obvious that the central angle that subtends the arc li-e. the angle between the points 
(—1,5), (—1,0) and (4, 0)] is a right angle and hence the arc is one quarter of the perimeter, i.e. it is 
5a 


l| 


Nl rR 


11. Find the shape of the closed plane curve of shortest length that encloses a given area. 
Answer: If we repeat the analysis of Problem 6 then we have H = \/1+y’? + A$ (xy! — y) where 
we shift the roles of F' and G in that Problem (because we are trying to optimize the length while 
imposing the area constraint). Accordingly, the Euler-Lagrange equation (Eq. 22) for this Problem is: 


0 a 1 ‘ d 0 7 1 ' 7 
By V1it+y + A5 (ay v| an (4 vity + Az (zy —y) = 0 
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Figure 32: Plot of the circle (a + 1)? + y? = 25 where the part of area o under the circular arc is shown 
in gray. See Problem 10 of § 2.4. 
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12. 


(2-£) +@-py = 5 


This is an equation of a circle with center (C/A, D) and radius |1/A|. So, the shape of the closed plane 
curve of shortest length that encloses a given area is a circle (noting that the solution cannot be a 
maximum because the length of the curve of a squeezed shape that encloses that area can diverge, e.g. 
a rectangle with one of its sides approaches zero). 

Note: the result of this Problem is intuitive and can be concluded from Problem 6 as a corollary using 
the proof by contradiction method!®! because if the closed plane curve of a given enclosed area (say 
oo) and shortest length (say po where p stands for perimeter) was not a circle then the circle of area oo 
should have a longer perimeter than po and hence the circle of perimeter po should have smaller area 
than oo and this contradicts the result of Problem 6 because the circle of perimeter pp should enclose 
maximum area. |°!1 

Find the equation of the closed plane curve of shortest length / that encloses an area o ~ 28.2743 and 
passes through the points (6,—1) and (3,2). Also, plot the result and find the length of the curve. 
Answer: From the answer of Problem 11 we know that this curve is a circle, so we can write the 
equation of the curve as (a — cr)” + (y— cy)” = R? where (Cx, Cy) is the center of the circle and R is 
its radius. Now, o = 28.2743 = 1R? and hence R = 3. Therefore, the equation of the curve becomes 
(x — cr)? + (y — cy)” =9. On substituting the points (6,—1) and (3,2) into this equation we get: 


(6—c,)?+(-l-g)? = 9 (71) 
(3-c,)°)+2-¢) = 9 (72) 
On subtracting Eq. 72 from Eq. 71 we get: 


(6 — cx)” + (-1— ey)” — (8c) —(2— gy)” = 
36 12c, +22 +142, +¢ 


4 - 9+ 6e, 2 4+ 4cy C= 
24— 6c, + 6c, = 
Cy = C,—4 (73) 


On substituting from this equation into Eq. 72 we get: 


(3 —cz)? + (6—cz)* = 

9- 6c, +2+4+36-12c,+c2 = 
c2 — 9cq + 18 

(cx — 3) (cz —6) = 


ono) 


Accordingly, c; = 3 or c, = 6 and hence c, = —1 or cy, = 2 (from Eq. 73). Hence, the equation of 
the circle is either (x — 3)” + (y +1)? = 9 or (a — 6)” + (y— 2)? =9 (ie. we actually have two circles 
that satisfy our requirements). The result is plotted in Figure 33. The length of the curve (i.e. the 
circumference of the circle) is 1 = 27 R = 62. 


[69] The proof by contradiction method should be linked in many cases to the “Principle of Reciprocity” which states (in one 


of its forms) that if y optimizes I; [y] subject to the condition that I2[y] is constant then y optimizes (usually in opposite 
sense) I2[y] subject to the condition that I;[y] is constant. However, this principle is not rigorous and hence it may be 
violated in some cases. We note that the use of [;[y] and I2[y] is a reference to the Lagrange multipliers formulation and 
symbolism (see § 1.8). 


161] For circle, o = mr? and p = 2nr (where o is its area, p is its perimeter, and r is its radius) which can be combined to 


2 
obtain o = r and hence the area increases/decreases as the perimeter increases/decreases (and vice versa). 
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Figure 33: Plot of the circles (x — 3)” + (y+ 1)” =9 and (x — 6)” + (y — 2)” = 9 of Problem 12 of § 2.4. 


Chapter 3 
Optimal Surfaces 


In this chapter we present and solve problems about topics and applications of the mathematics of variation 
related to optimal surfaces, i.e. we are looking in these problems to certain surfaces (or 2D objects) that 
optimize something (such as area). 


3.1 2D Planar Shapes of Optimal Perimeter 


From the title of this section, it is clear that in this type of problems it is required to optimize the perimeter 
of a given 2D planar shape (such as triangle or rectangle) subject to certain condition(s) such as being 
of fixed area. We note that due to the simplicity of this type of problems we use ordinary calculus 
(occasionally in association with the Lagrange multipliers technique) with no need for the variational 
formulation of the Euler-Lagrange equation. 


Problems 


1. What is the shape of the rectangle of fixed area and optimal perimeter?!®! 
Answer: If p is the perimeter of the rectangle, o is its area, L is its length and W is its width then 
we have p= 2(L + W) and o = LW. Therefore, W = and hence p = 2 (L + z), On differentiating 
p with respect to L and setting the derivative to zero (since it vanishes when p is optimal) we obtain 
the optimization condition, that is: 


# =2(1- 4) =0 and hence L= Jo and W=T=-ve 


Therefore, L = W = ,/o which means that the rectangle should be a square to optimize the perimeter. 
Note: it is obvious that the optimal perimeter in this Problem is a minimum (not a maximum) because 
the perimeter can diverge when the width of the rectangle approaches zero (noting that the area is 
fixed). 
2. Re-solve Problem 1 but this time use the Lagrange multipliers technique (see § 1.8). 

Answer: Let the length and width of the rectangle be LZ and W and hence we want to optimize 
f =2(L+4W) (which is the perimeter) subject to the constraint that the area LW is constant (and 
hence g = LW is constant). So, we should optimize h = f + Ag = 2(L4+ W) + ALW by taking the 
partial derivatives of h with respect to the variables L and W and setting the derivatives to zero to 
obtain the optimization conditions, that is: 


Oh 

Ap 2+AW =0 and hence W =-2/r 
OES ne ON a) dh L =-2/d 
aw + = and hence =— 


Accordingly, L = W which again means that the rectangle should be a square to optimize its perimeter. 
Note: again, the optimal perimeter in this Problem is a minimum (for the same reason). 

3. What is the shape of the triangle with fixed area and optimal perimeter?|®*! 
Answer: Referring to Figure 34 we have: 


[62] “Shape” in this context refers to a distinctive feature in the generic shape (which is rectangle) and hence this question 
can be posed as: what is the length to width ratio of the rectangle of fixed area and optimal perimeter? 

[63] As indicated earlier, “shape” in questions like this means a distinctive feature in the generic shape of the object (such as 
being isosceles or equilateral or having certain height to base ratio). 
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Figure 34: A schematic illustration of the setting of Problem 3 of § 3. 
to be optimized in its perimeter p=a+b+4+c. 


1 where a triangle of fixed area a is 


p = errr, ayer 


h 
og = sah =o 5 (e+ 9) 


where p is the perimeter, a,b,c are the lengths of the triangle sides, o is its area, h is its height, and 
x+y=a. Now, we are supposed to optimize p subject to the restriction on o (since it is fixed), and 


hence we use the Lagrange multipliers method (see § 1.8) with f 4 


t Ag = ptAod, that is: 


ptro = (a@+y) + Va? +h? + /y? +h? + 


AZ (e+) 


To optimize p+ Ao we need to take the partial derivatives of p+ Ao with respect to the variables z, y, h 


and set the derivatives to zero to obtain the optimization conditions, that is: 
6) x Ah 
—(p+aA =r] =0 74 
6) y xh 
— (n+) = =0 75 
5, + Aa) ae (75) 
O h h r 
= (ptAo) = x+y) =0 76 
gp PM) = et ee te ty) (76) 
On subtracting Eq. 75 from Eq. 74 we get: 
x y 4% 
Jenn wea 
zt = ¥ 
Vazt+h2 /y2 + p2 
we 7 y? 
y+ he y2 +h? 
a2 (v" 4 h?) = y" ie a h?) 
gy + ah? — oy? — yh? = 0 
i (e— v) =O 
ey = 0 (h £0) 
(tx-y)(e@+y) = 0 
x-y = 0 (c+y=aF0) 
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TS oy 
Therefore, the area becomes: 
h 
o=slety=s(t2)= 


while Eqs. 74-76 reduce to the following two equations: 


i x Ah 
T Fae RD a he T 2 
2h 
+ Ax 


(27) = xh 
= 0 
= 0 
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(77) 


(78) 


(79) 


On multiplying Eq. 78 with x and Eq. 79 with h/2 and subtracting the second from the first we get: 


x? h? 
' eae Vaz + h2 =o 
h? — x? 
© VaR 
2 (h? — a)” 
BE ar: Benge 
e+ath? = ht —2e7h? +24 
Ch She Oe he 
o = h*—20? 
30° = he 
h2 
7 WB 
_ oA 
ma Sg 


Accordingly, the length of the three sides are: 


2h 
e= ee cen 


2 
b = Va2r+h? = [Ean ae 77a 


(substituting from Eq. 77) 


(using Eq. 77) 


(Using Pythagoras) 


h? Ah? 
c = VPth=V22 +h? = Fan yes 


3 


2h 
—— (Using Pythagoras) 
v3 


So,a=b=c= a which means that our triangle is equilateral. 


Note 1: it should be obvious that the optimal perimeter in this Problem is a minimum (not a 
maximum) because for a triangle with a fixed area if the height (or the base) approaches zero the 


perimeter diverges. 


Note 2: although the above solution is based on the configuration of Figure 34 (which seems rather 
restricted) it can be easily adapted to include all possible configurations. 


Answer: The perimeter of ellipse is: 


. What is the shape of the ellipse with fixed area and optimal perimeter (or circumference)? 


n/2 
p= | dav 1 — e? sin? 6 dO 
0 
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where a is its semi-major axis while e is its eccentricity. Now, we are supposed to optimize p subject 
to the restriction on a (since it is fixed), and hence we can use the Lagrange multipliers method with 
f+Ag=p+Aae, that is: 


n/2 
prro= [ dav 1 — e? sin? 6 dé + ro 
0 


So, we need to optimize p+ Ao by taking its derivative with respect to e (since p+ Ac is a function of 
e) and setting the derivative to zero to obtain the optimization condition, that is: 


d 
ae, do) = 0 
Te (p+ Ac) 
m/2 ra) 
| wz (4aV1 — €? sin? 9) Ao = oO 
™/2  _Agesin? 6 


———. dé 
0 V1—e2sin? 6 


As we see, this integral can be zero only if e = 0 which means that the ellipse is a circle. So, the 
ellipse of fixed area and optimal perimeter is a circle (which is a special case of ellipse corresponding 
toe =0). 

Note 1: in differentiating the integral we use the fact that if € is a function of two variables (say t¢ 
and «) and 


B 
=f é (t,2) de 


(with a and £ being constants) then 


dot) ? OE (bya) 
/ DE dx 


Note 2: the optimal perimeter in this Problem is obviously a minimum (not a maximum) because for 
an ellipse of fixed area the perimeter diverges when the minor axis approaches zero (i.e. e approaches 
one). 


3.2. 2D Planar Shapes of Optimal Area 


From the title of this section, it is obvious that in this type of problems it is required to optimize the area 
of a given 2D planar shape (such as triangle or rectangle) subject to certain condition(s) such as being 
of fixed perimeter. As in the Problems of § 3.1, we use in solving these problems ordinary calculus only 
with no need for using the Euler-Lagrange variational formulation. 


Problems 


1. What is the shape of the rectangle of fixed perimeter p and optimal area? 
Answer: If we use the setting and labeling of Problem 1 of § 3.1 and follow a similar reasoning then 
we can say: in the present Problem we have o = LW and p= 2(L+W) and therefore W = (p/2)—L 
and hence o = L |[(p/2) — L]. On differentiating o with respect to L and setting the derivative to zero 
(since it vanishes when o is optimal) we obtain the optimization condition, that is: 

do 


Wi Fo 2L=0 and hence L mi and W 5 L eo ho 


Therefore, L = W = p/4 which means that the rectangle should be a square to optimize its area. 
Note: it is obvious that the optimal area in this Problem is a maximum (not a minimum) because the 
area can approach zero when the width of the rectangle approaches zero (noting that the perimeter is 
fixed). 


3.2 2D Planar Shapes of Optimal Area 117 


2. 


Re-solve Problem 1 but this time use the Lagrange multipliers technique (see § 1.8). 

Answer: If we use the setting and labeling of Problem 2 of § 3.1 and follow a similar reasoning then 
we can say: in the present Problem we want to optimize f = LW (which is the area) subject to the 
constraint that the perimeter 2(Z + W) is constant (and hence g = L+ W is constant). So, we should 
optimize h = f + Ag = LW + X(L + W) by taking the partial derivative of h with respect to the 
variables EL and W and setting the derivatives to zero to obtain the optimization conditions, that is: 


Oh 
at. = W+A=0 and hence W=-2A 
a = L+A=0 and hence L=-xX 


Accordingly, L = W which again means that the rectangle should be a square to optimize its area. 
Note: again, the optimal area in this Problem is a maximum (for the same reason). 


. Re-solve Problem 1 but this time use the proof by contradiction method. 


Answer: The result of Problem 1 (or Problem 2) can be obtained as a corollary from the result of 
Problem 1 of § 3.1 (or Problem 2 of § 3.1) because if the rectangle of the given perimeter (say po) and 
maximum area (say oo) was not a square then the square of perimeter po should have a smaller area 
than oo and hence the square of area do should have longer perimeter than po and this contradicts the 
result of Problem 1 of § 3.1 (or Problem 2 of § 3.1) because the square of area og should have minimum 
perimeter. |4I 

Note: the course of action in the present Problem can be reversed by using the result of Problem 1 of 
the present section (or Problem 2 of the present section) in conjunction with the proof by contradiction 
method to establish the result of Problem 1 of § 3.1 (or Problem 2 of § 3.1). In this case we would 
say: the result of Problem 1 of § 3.1 (or Problem 2 of § 3.1) can be obtained as a corollary from the 
result of Problem 1 of the present section (or Problem 2 of the present section) because if the rectangle 
of a given area (say oo) and minimum perimeter (say po) was not a square then the square of area 
oo should have a longer perimeter than po and hence the square of perimeter pp should have smaller 
area than oo and this contradicts the result of Problem 1 of the present section (or Problem 2 of the 
present section) because the square of perimeter po should have maximum area. [65] 


. What is the shape of the triangle with fixed perimeter and optimal area? 


Answer: It is equilateral. This is a corollary of the result of Problem 3 of § 3.1 because if the 
equilateral triangle with fixed perimeter (say po) is not optimal (i.e. maximum; see the upcoming 
note) in area then there should be a non-equilateral triangle with perimeter pp and with maximum 
area (say oo) and hence an equilateral triangle of area oo will have a longer perimeter than po which 
contradicts the result of Problem 3 of § 3.1 because the equilateral triangle of area o9 has minimum 
perimeter. |®¢ 

Note: it should be obvious that the optimal area in this Problem is a maximum (not a minimum) 
because for a triangle with a fixed perimeter if the height (or the base) approaches zero the area 
converges to zero. 


. What is the shape of the triangle of optimal area whose perimeter and one of its sides are fixed (i.e. 


only two of its sides can vary while keeping their sum fixed) .|°7] 
Answer: If we follow the method of Problem 3 of § 3.1 (assuming the fixed side to be a) then this 


64 


66 


67 


For square, p = 4x and o = x? (where p is its perimeter, o is its area, and x is the length of its sides) which can 
be combined to obtain p = 4,/o and hence the perimeter increases/decreases as the area increases/decreases (and vice 
versa). 


5] For square, o = x? and p = 4a (where a is its area, p is its perimeter, and x is the length of its sides) which can be 


combined to obtain ¢ = p?/16 and hence the area increases/decreases as the perimeter increases/decreases (and vice 
versa). 


V3 


For equilateral triangle, p = 3a and 0 = Ba? (where p is its perimeter, o is its area, and a is the length of its sides) 
which can be combined to obtain p = 6, / Ja and hence the perimeter increases/decreases as the area increases /decreases 


(and vice versa). 
We can also characterize this triangle as: the length of one of its sides and the sum of the lengths of its two other sides 
are fixed. 
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time we have f+ Ag =o +Ap = E(a@+y)+A(aty) + AVa? +h? + A/y? + h? (because here we 
want to optimize the area subject to a perimeter constraint) and hence: 


O h x 

—- —- —l ‘4. | — 
agree 2 a( jena) 

a oh y 5 
es = F+a(14+ tes) <0 


On subtracting the second equation from the first equation we get: 


x y =; 
Varth? fyr+h? — 


and hence x = y (as shown in Problem 3 of § 3.1) which means that the triangle is an isosceles. 
Note 1: it should be obvious that the optimal area in this Problem is a maximum (not a minimum) 
because for a triangle with a fixed perimeter (regardless of having a fixed side or not) if the height 
approaches zero the area converges to zero. 
Note 2: if the length of the fixed side is half the sum of the other two sides then the optimal triangle 
of this Problem becomes equilateral and this Problem becomes an instance of Problem 4. 

6. What is the shape of the ellipse with fixed perimeter and optimal area? 
Answer: It should be a circle. This is a corollary of the result of Problem 4 of § 3.1 because if the 
circle with the fixed perimeter (say po) is not optimal (i.e. maximum; see the upcoming note) in area 
then there should be an ellipse with perimeter po and with maximum area (say 09) and hence a circle 
of area og will have a longer perimeter than po which contradicts the result of Problem 4 of § 3.1 
because the circle of area og has minimum perimeter. 
Note: it should be obvious that the optimal area in this Problem is a maximum (not a minimum) 
because for an ellipse with a fixed perimeter if the minor axis approaches zero (i.e. e approaches one) 
the area converges to zero. 


3.3. 2D Planar Shapes Inside Other 2D Planar Shapes 


In this type of problems it is required to optimize a certain property (such as area) of a given 2D planar 
shape (such as triangle) that is confined inside another 2D planar shape (such as circle). Again, we 
use in solving these problems ordinary calculus only with no need for the variational formulation of the 
Euler-Lagrange equation. 


Problems 


1. What is the shape of the triangle of optimal perimeter inscribed inside a circle? 

Answer: Referring to Figure 35, we use a Cartesian coordinate system centered on the center of the 
circle. With no loss of generality, we put one of the triangle vertices (i.e. vertex A) on the positive x 
axis. Now, because the size of the radius of the circle does not affect the basic shape of the inscribed 
triangle (since the radius is just a scaling factor that scales the triangle up or down without affecting its 
features) we use (with no loss of generality) a unit circle. This is to simplify and reduce the required 
algebra. To formulate the Problem variationally we introduce two positive angles: @ (which is the 
positive angle AOB) and ¢ (which is the positive angle AOC). So, the setting of the Problem is as in 
Figure 35. 

Now, from Figure 35 the perimeter p of the triangle is given by: 


p = a+c+t+b 
va — cos 6)? + sin? 6 + va — cos ¢)2 + sin? ¢ + \/(cos 6 — cos ¢)? + (sin 6 — sin ¢)? 


I 


V1 —2cos@ + cos? 6 + sin? 6 + 1/1 2cos @ + cos? b + sin? + 


I 
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Figure 35: The setting of Problem 1 of § 3.3 where a triangle with vertices A(1,0), B(cos0,sin@) and 
C(cos ¢, sin ¢) is inscribed inside a unit circle centered on the origin of coordinates with a,b,c being the 
lengths of the sides of the triangle. 


(cos? @ — 2 cos cos @ + cos? ¢ + sin? @ — 2sin sing + sin? ¢ 
= v2 2cos 6 + \/2 2cos @ + \/2 — 2cos cos @ — 2sind sind 
= v2 [vi cos 8 V1 cos @ V1 cos cos G — sin 8 sin 9] 
= v2 | VI—cosd + 1 cos ¢ + y/1 — cos (8 @)| 


To optimize p we take its partial derivatives with respect to the variables 6 and ¢ and set the derivatives 
to zero to obtain the optimization conditions, that is: 


top _ 
/2 06 
sin 0 sin (0 — ¢) 
= 0 
2V1—cos@ 24/1 —cos (6 — 6) 
sin 0 - sin (6 — ¢) 
V1 —cosé = 1 — cos (0 — ¢) 
sin? 6 sin? (0 — ¢) 
1—cos6 1—cos(@— ¢) 
1—cos?@ — 1 -cos?(@-@¢) 
1 —cosd 7 1 —cos(@— ¢) 
1l+cos6é = 1+ cos(@- @) 
cos0 = cos(#—@) (80) 
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Wr . 4 
V2 0¢ 
sin b sin (@ — ¢) Be . 4h 
2/1—cosd 24/1 — cos (8 — ¢) 
sin b _ sin (0 — ¢) 
Vi—cosé 1 —cos (6 — ¢) 
sin? _ sin? (0 — ¢) 
l1—cosé  ~—« 1~cos(@— 4) 
l—cos*¢? = 1-cos?(9-¢) 
1—cos@ 7 1—cos (0 — ¢) 
l+cosé = 1+ cos(@- ¢) 
cos = cos(#—¢) (81) 


On combining Eqs. 80 and 81 we get cos? = cos¢ = cos(9—¢). Now, if we note that cosd = 
cos (27 — d) and cos(@ — ¢) = cos(¢@— 6) then we have cos@ = cos (2m — ¢) = cos(¢@— 6). Noting 
that 0 is the positive angle AOB, (27 — ¢) is the positive angle COA, and (¢— 9) is the positive angle 
BOC, we conclude (within the given constraints) that the angles AOB, BOC and COA are equal which 
means that the triangle is equilateral. 
Note: it should be obvious that the optimal perimeter in this Problem is a maximum (not a minimum) 
because the perimeter can converge to zero when the vertices of the triangle become too close to each 
other. |®8! 
. Calculate the length of the longest perimeter of a triangle inscribed in a circle of radius R. 
Answer: From the result of Problem 1, the inscribed triangle of longest perimeter is equilateral and 
hence the length is: 

p = 6Reos = =6Rx ae = 33k 
. What is the shape of the triangle of optimal area inscribed inside a circle? 
Answer: We refer to Figure 36 where the setting of the Problem is illustrated. Because the area o of 
an inscribed triangle is equal to the area of the circle minus the sum S of the areas of the segments 
(shaded gray in Figure 36) we will optimize (i.e. minimize) S rather than optimize (i.e. maximize) o. 
r2 
2 
angle. With no loss of generality we can use a circle of radius r = V2 (because the radius r is just a 
scaling factor that does not affect the basic shape of the inscribed triangle) and hence the formula of 
a, becomes 0, = a—sina. So, our task is to optimize S which is the sum of 01, 02, 03 (see Figure 36) 
that is: 


Now, the area o, of a segment of a circle of radius r is og = 4 (a —sina) where a is the subtended 


S = o,+02+03 
= [0-sin6] + [¢-sing] + [(27 — 6 — ¢) — sin(2x — 6 - 4)] 


Now, to optimize S' we take its partial derivatives with respect to 6 and ¢ and set the derivatives to 
zero to obtain the optimization conditions, that is: 


2s = 1-cos@—1-+ cos(27 — @— ¢) = —cos6 4+ cos(2m — 6 — ¢) =0 (82) 
- = 1—cos¢—1+cos(2m —6— ¢) = —cos¢+4+cos(27 — 6— d) = 0 (83) 


On subtracting Eq. 82 from Eq. 83 we get: 


cos9—cos@ = 0 


[68] In fact, the perimeter of the inscribed equilateral triangle (which is p = 3/3R with R being the radius of the circle) is 


greater than the perimeter in another limiting case when two vertices approach each other while the other vertex is on 
the other side of the circle (and hence the perimeter approaches p = 4R). 
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Figure 36: The setting of Problem 3 of § 3.3 where a triangle with vertices A, B and C is inscribed inside 
a circle of radius r = V2. 


cos@ = cosd (84) 
Also, On adding Eq. 82 and Eq. 83 we get: 
—cos6—cos¢+2cos(2x-A-—¢) = 0 
—2cos#+2cos(2a-0—¢) = 0 (substituting from Eq. 84) 
cosO = cos(2x — 6-9) (85) 


So, from Eqs. 84 and 85 we can see that (within the given restrictions) 0 = ¢ = 2x — 0 — ¢ which 
means that the three angles that determine the shape of the triangle are equal and hence the triangle 
is equilateral. 
Note 1: it is obvious that in this Problem the optimal area of the triangle is a maximum (not a 
minimum) because the area can converge to zero when the vertices of the triangle (or two of them) 
become too close to each other. 
Note 2: in our answer we optimized the sum S of the circle segments 01, 02, 03 instead of optimizing 
the area of the triangle directly (by optimizing the sum of the three inner triangles seen in Figure 
36) to avoid necessary (and rather complicated) justifications according to some possible settings and 
configurations of the problem. 

4. Calculate the area of the triangle of maximum area inscribed in a circle of radius R. 
Answer: From the result of Problem 3, the inscribed triangle of maximum area is equilateral and 
hence the area is: 


Se ean ahen = seeRan = = ove 
2 3. 2 2 4 
5. What is the shape of the rectangle of optimal perimeter inscribed inside a circle? 
Answer: We use a Cartesian coordinate system centered on the center of the circle with the sides of 
the rectangle being parallel to the axes (see Figure 37). Because the size of the radius of the circle does 
not affect the basic shape of the inscribed rectangle (since the radius is just a scale factor) we use (with 
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no loss of generality) a unit circle. Due to the symmetry, the perimeter p is given by p = 4(a+ 3) where 
a and £ are the coordinates of the vertex in the first quadrant. Hence, we are required to optimize 
a+ 6 (because 4 is no more than a scale factor). Now, the vertex (a, 3) is on the circle and hence it 
should satisfy the equation of the circle 2? + y? = 1, that is: 


Figure 37: The setting of Problem 5 of § 3.3 where a rectangle (with its sides being parallel to the 
coordinate axes) is inscribed inside a unit circle centered on the origin of coordinates with a and 8 being 
the « and y coordinates of the vertex in the first quadrant. 


ar 458" 1 
B = vV1- a? 


Accordingly, we are required to optimize a + V1 — a? by taking its derivative with respect to a and 
setting it to zero to obtain the optimization condition, that is: 


d 
=, (e+ 1— 0?) = 0 
igen. SG 
Vl—-at 
2 
ae 
1-—a? 
207 = 1 
1 


Oe = 


Therefore, 8 = /1—a? = wi and hence a = 6 = a which means that the rectangle is a square 
(noting that the square is a special case of rectangle in which the length and width are equal). 
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Note 1: according to the result of this Problem, the length of the longest perimeter of a rectangle 
inscribed in a circle of radius R is 4\/2R (where this rectangle is a square). 
Note 2: it is obvious that the optimal perimeter in this Problem is a maximum (not a minimum) 
because the perimeter can converge to 4R (i.e. 4 times the circle radius) when the width of the 
rectangle approaches zero. 

6. What is the shape of the rectangle of optimal area inscribed inside a circle? 
Answer: If we follow the setting and reasoning of Problem 5 then the area a of the rectangle is: 


a = 4a8 = 4avV1 — a? = 4a? — a4 


Accordingly, we are required to optimize Va? — a* by taking its derivative with respect to a and 
setting it to zero to obtain the optimization condition, that is: 


da 
2a — 4a _ 9 
2/a2 — at 
a—2a° = 0 
a(1—207) = 0 
1-20? = 0 (a #0) 
<2 
oS lp 
Therefore, 6 = V1 — a? = 7 and hence a = 6 = +5 which means that the rectangle is a square. 


Note 1: according to the result of this Problem, the maximum area of a rectangle inscribed in a circle 
of radius R is 2R? (where this rectangle is a square). 
Note 2: it is obvious that the optimal area in this Problem is a maximum (not a minimum) because 
the area can converge to zero when the width of the rectangle approaches zero. 

7. What is the shape of the rectangle of optimal perimeter inscribed inside an ellipse? Also, find the 
length of the optimal perimeter. 
Answer: Let use a Cartesian coordinate system centered on the center of the ellipse with the major 
axis of the ellipse being on the x axis and hence the sides of the rectangle being parallel to the coordinate 
axes (see Figure 38). Also, let the ellipse have semi-major axis a and semi-minor axis b. Due to the 
symmetry, the perimeter p of the rectangle is given by p = 4(a+ 8) where a and £ are the coordinates 
of the vertex in the first quadrant. Hence, we are required to optimize a + 3 (because 4 is no more 
than a scale factor). Now, the vertex (a, 3) is on the ellipse and hence it should satisfy the equation 


of the ellipse - + ue = 1, that is: 


a2 2 
sae 
a b? 


/ a2 


Accordingly, we are required to optimize a + by/1— a by taking its derivative with respect to a and 
setting it to zero to obtain the optimization condition, that is: 


d a? 
ae |: T b 1 | = 0 
ppt): es 
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Y 


a 


Figure 38: The setting of Problem 7 of § 3.3 where a rectangle is inscribed inside an ellipse (with semi- 
major axis a and semi-minor axis b) centered on the origin of coordinates. The major axis of the ellipse 
is on the x axis (and hence the sides of the rectangle are parallel to the coordinate axes), and a and 8 
are the x and y coordinates of the rectangle vertex in the first quadrant. 


jo = 0 
a? 1-% 
ba 
: a4 — ata? on 
ba? 
ae an 1 
bar+aza? = at 
2 at 
BO Gas 
52 
a Jer 


Therefore: 


a? 1 a4 a? b? 
=b/1- — =d,/1- = ( ——__) =d,/1 = 
P a i a (= + F) a+b Jar +b? 


Hence, the rectangle of optimal perimeter has length LD = 2a = ae and width W = 26 = ee 
So, its perimeter is: 


2 2 2 1. A2 


=20+2W = + 
P Va2 + b2 Va2 + b2 az + b2 


Note 1: it is obvious that the optimal perimeter in this Problem is a maximum (not a minimum) 
because the perimeter can converge to 4b when a approaches zero and it can converge to 4a when 8 
approaches zero (noting that 4b < 4Va? + 6? and 4a < 4Va? + 67). 


Note 2: if we set a = b = R (i.e. the ellipse becomes a circle of radius R) then we obtain a = 6 = He 
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i.e. we retrieve (as a special case) the result of the circle that we obtained in Problem 5 (noting that 
in Problem 5 we used unit circle). 

8. What is the shape of the rectangle of optimal area inscribed inside an ellipse? Also, find the optimal 
area. 
Answer: If we follow the setting and reasoning of Problem 7 then the area a of the rectangle is: 


2 Ab 
o = 4aB = 4ab\/1- = = — a2a?2 — a4 
a a 


Accordingly, we are required to optimize Va?a? — a* by taking its derivative with respect to a and 
setting it to zero to obtain the optimization condition, that is: 


d 
rr (v a2a? — a!) = 0 
fay 
2a7a — 4a? 
= 0 
2/a2a2 — at 


Qa 
Q 
| 
bo 
Q 
we 
oO 


Therefore: 


Hence, the rectangle of optimal area has length L = 2a = 2a and width W = 26 = V2b. So, its area 
is: 


o = LW = V2av2b = 2ab 


Note 1: it is obvious that the area in this Problem is a maximum (not a minimum) because the area 

can converge to zero when a or ( approaches zero. 

Note 2: if we set a = b = R (i.e. the ellipse becomes a circle of radius R) then we obtain a = 8 = 5, 

i.e. we retrieve (as a special case) the result of the circle that we obtained in Problem 6 (noting that 
in Problem 6 we used unit circle). 
Note 3: comparing the result of this Problem to the result of Problem 7 we see that the optimal 
rectangles in these Problems are different. This is unlike the Corresponding Problems of rectangles 
inscribed in circles (see Problems 5 and 6) where the two “rectangles” are the same because they are 
actually squares due to the circular symmetry. 

9. What is the shape of the triangle of optimal area whose two vertices are on the foci of an ellipse while 
the other vertex is on the perimeter of the ellipse? 

Answer: According to the definition of ellipse,!®9! the perimeter of this triangle is fixed and hence 
this Problem is an instance of Problem 5 of § 3.2 since the perimeter and one of the sides (i.e. the 
side that connects the two foci) of the triangle are fixed. Therefore, the triangle is an isosceles and its 
optimal area is a maximum (according to note 1 of Problem 5 of § 3.2). 

Note 1: the optimal area in this Problem is o = bc = bya? — b? where a is the semi-major axis of the 
ellipse, b is the semi-minor axis and c is the distance between the center and a focus. 

Note 2: the result of the present Problem can be easily obtained from the fact that the area of triangle 
is half its base times its height plus the fact that in our case the base is fixed while the height takes its 


[69] As it should be known from elementary geometry, an ellipse is a plane curve with the property that the sum of the 
distances between any point on the curve and two fixed points (i.e. its foci) in the plane is constant (which is equal to 
2a with a being the semi-major axis of the ellipse). 
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optimal (maximum) value when the two sides are equal (i.e. when the moving vertex of the triangle is 
at the co-vertex of the ellipse) .!7°! 
10. What is the shape of the triangle of optimal perimeter whose two vertices are on the vertices of an 
ellipse!74| while the other vertex is on the perimeter of the ellipse? 
Answer: Referring to Figure 39, the side of the triangle on the major axis is fixed and hence we 
only need to optimize the sum S' of the other two sides whose lengths are \/h? + (a+ X)? and 
h? + (a— X)? (where 0<h<band0< X <a), that is: 


Y 


a 


Figure 39: The setting of Problem 10 of § 3.3 where a triangle is inscribed inside an ellipse (with semi- 
major axis a and semi-minor axis b) centered on the origin of coordinates. The major axis of the ellipse 
(which is on the x axis) forms one side of the triangle while the opposite vertex of the triangle is on the 
ellipse perimeter. 


S=VJSh? + (a+ X)2 4+ Vk? + (a— X)? 


Now, to optimize S we take its partial derivatives with respect to h and X and set them to zero to 
obtain the optimization conditions, that is: 


Os 
Dh = 0 
h ‘ h a i 
Vh2 + (a+X)2 Vk? + (a- XxX)? 
1 1 
h es ee i 
JVh2 +(at+X)? Sh? + (a—X)? 
h = O (86) 
AND 
Os 
a © 


[70] We mean by “co-vertex of the ellipse” the end point of the minor axis. 
[71] We mean by “vertices of an ellipse” the end points of the major axis. 
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11. 


(a+ X) —(a—X) 


Sh? + (a+ X)? i Jh2+(a—X)2 ° 
(+X? XP 
h?+(a+X) h? + (a—X)? 

(a+X)? [h? + (a—X)?] = (a—X)? [h?+(a+X)’] 
WlatxX)+(at+xXV(a—-X? = Wla-XP+(at+X\(e-XP 
Wiat+tXY = h(a-—X) 

(a+X) = (a—X)? (h #0) 
a? +2aX+X7 = a? —2aX + X? 
2aX = —2aX 
x = —X (a £0) 
Xx = 0 (87) 


As we see, Eq. 86 is unacceptable because 0 < h < 6 (in fact h = 0 corresponds to the lower limit of 
the perimeter). However, Eq. 87 is acceptable (since 0 < X < a) and hence the triangle is an isosceles 
(with h = b). Accordingly, the triangle in this Problem is an isosceles with an optimal perimeter of 
2a + 2Va? + b?. 

Note 1: if the fixed side in this Problem is on the minor axis then the result is similar, i.e. the triangle 
is an isosceles with an optimal perimeter of 2b + 2a? + b?. 

Note 2: it is obvious that the optimal perimeter in this Problem is a maximum (not a minimum) 
because the perimeter can converge to 4a (or 4b) when the height h of the triangle approaches zero. 
Note 3: the circle is a special case of ellipse and hence the result of this Problem also applies to circles, 
i.e. the triangle of optimal perimeter whose one side is a diameter of a circle while the opposite vertex 
is on the circumference of the circle is an isosceles of optimal (ie. maximum) perimeter of 2R+2/2R 
(with R being the radius of the circle). 

What is the shape of the triangle of optimal area whose two vertices are on the vertices of an ellipse 
while the other vertex is on the perimeter of the ellipse? 

Answer: Referring to the setting of Problem 10 and Figure 39, the area o of the triangle is half the 
base (which is 2a) times the height (which is h) and hence o = ah. Now, a is constant and hence the 
optimal of o occurs when h is maximum (noting that o 4 0 and h 4 0). Hence, from the condition 
0 <h <b we conclude that the optimal area is when h = b. Accordingly, the triangle in this Problem 
is an isosceles with an optimal (i.e. maximum) area of ab. 

Note 1: if the fixed side in this Problem is on the minor axis then the result is similar, i.e. the triangle 
is an isosceles with an optimal (i.e. maximum) area of ab. 

Note 2: the circle is a special case of ellipse and hence the result of this Problem also applies to circles, 
i.e. the triangle of optimal area whose one side is a diameter of a circle while the opposite vertex is on 
the circumference of the circle is an isosceles of optimal (i.e. maximum) area of R? (with R being the 
radius of the circle). 


3.4 Surface of Revolution of Optimal Area 


In this type of problems it is required to optimize the area of a surface of revolution!”?! generated by 
rotating a planar curve [ represented by the function y = y(a) with fixed end points A(ax,4,y,a) and 
B (ap, yp) around the « axis (see Figure 40).!73! So, the objective is to find the form of the planar curve 
that produces such a surface of revolution of optimal area. It should be noted that although the immediate 
objective here is to find the shape of a curve, the ultimate objective is to find the shape of a surface and 
hence the subject is classified as an optimal surface issue. This is due to the fact that we are dealing with 
a surface of revolution which is completely determined by its profile curve. 


[72] 


“Surface of revolution” is a surface generated by revolving a plane curve around a straight line in its plane. 


[73] The representation of I by the function y = y (zx) is the common case. Other representations are also possible. 
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Figure 40: A simple sketch depicting the setting of Problem 1 of § 3.4 where a surface of revolution is 
generated by revolving a planar curve [ around the x axis. The curve IT is represented by a function 
y(x) and it connects two fixed points (A and B) with ds representing the length of an infinitesimal arc 
generating an infinitesimal ring of radius y and width ds. 


Problems 


1. Find the shape of the surface of revolution of optimal area (as described above in the text). 
Answer: The area o of a surface of revolution is the sum of the areas of the infinitesimal rings of 
radius y and width ds (see Figure 40). Noting that the area of each one of these rings (which are 
infinitesimal cylinders) is given by 27yds (i.e. perimeter 27y times width ds), the area o should be 
given by the following integral (which is the functional that we intend to minimize in this Problem): 


vB 
a= | 2nyds=2n [ yV1+y? dz = I[y| 
Tr LA 


Accordingly, F (x,y, y’) =y/1+y?. As we see, F' in this Problem is identical to F in the problem of 
catenary (see Problem 1 of § 2.3). Therefore, the shape of the curve I (which represents the profile of 
the required surface of revolution) should also be a catenary (i.e. hyperbolic cosine), that is: 


wate (=) (88) 


A surface generated by the revolution of a catenary (around its horizontal axis) is called catenoid, and 
hence the surface of revolution of optimal area is a catenoid. 

Note 1: the constants C' and D in Eq. 88 can be determined from the two boundary conditions at 
the end points of the curve (see Problem 2). 

Note 2: the optimal solution of this Problem should be a minimum (not a maximum) because it is 
obvious that the surface area generated by the revolution of some curves can diverge and hence the 
optimal solution cannot be a maximum.!"4l Also see Problem 4. 

Note 3: the existence and uniqueness of the solution of this Problem is not guaranteed. In more 


[74] Although this sort of arguments essentially rules out the possibility of global (or absolute) maximum, it should work in 
general even for local maximum (noting also our lax approach in pursuing issues like determining the nature of optimum 
and considering the upcoming Problem 4 as well although it is also based ultimately on a similar argument). 
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accurate terms, depending on the positions of the fixed end points (A and B) there may be one 
solution of this type (which is usually the required solution), or more than one solution (with only one 
possibly representing the required minimal surface) or there is no solution at all. 75] 

2. Given that the profile curve of the surface of Problem 1 passes through the boundary points (0,1) and 
(0.5, 2), find the equation of this profile curve and plot it. 
Answer: We start by solving Eq. 88 for D, that is: 


y 
D =x -— Carccosh (4) 
C 
On substituting the coordinates of the two boundary points into this equation we get: 


D 


I 


1 
0-C h (| = 89 
arccos ( z) (89) 


D 


I 


0.5 — C'arccosh (z) (90) 


On subtracting Eq. 89 from Eq. 90 we get: 


0.5 — Carccosh (=) + Carccosh (z) =0 


On solving this equation for C' (using a numerical solver) we get C ~ 0.6348247523 and hence D ~ 

—0.6518922526 (where we use Eq. 89 or Eq. 90 to find D). Accordingly, the equation of the profile 

curve is: 

x + 0.6518922526 
0.6348247523 


y & 0.6348247523 cosh ( ) ~ 0.6348247523 cosh (1.575237885a + 1.026885373) 


The profile curve is plotted in Figure 41.!76 

3. Confirm the result of Problem 1 using cylindrical coordinates. 
Answer: A surface of revolution (with a profile curve I’) whose axis of symmetry is the z axis can be 
given in cylindrical coordinates as p = p(z). Hence, its surface area is: 


c= i. 2apds = an f pv (dp)” + (dz)? = an f pvp? +1dz= I[p] 
T be Zz 


where p’ = dp/dz. Accordingly, F (z, p, p’) = pvp’? + 1 and hence the Euler-Lagrange equation is (see 
Eq. 3 noting the correspondence between z, p,p’ here and x,y, y’ there and the absence of z here): 


re) 
pA Fag L p+ = C 


12 
p (pe ar Re = ¢ 


/ p? +1 
12 12 
p(e?+1)-— pp? = Crp? +1 
p Cyn? +1 
2 
; P a 
. ( eg 
[75] Tn fact, there are many details here about the nature of the solution and if it exists or not. However, we do not go through 
these details since our primary objective in this book is the investigation of the techniques of variational calculus. The 
interested reader should refer to the literature (see for example Weinstock in the References). 


[76] As in footnote [75] , in this Problem and its alike we are only interested in investigating and demonstrating the techniques 
(rather than the nature of the solution and its physical significance as we have no primary interest in these details). 


I 


I 
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0 0.1 0.2 0.3 0.4 0.5 
X 


Figure 41: Plot of the profile curve of the minimum surface of revolution that passes through the boundary 
points (0,1) and (0.5, 2). See Problem 2 of § 3.4. 


ih oe 
(6)°-1 
- La 
2 = Carecosh (4) + D (91) 
z—-D 
p = Cosh ( CG ) 


Now, if we note the above-indicated correspondence between z, p, p’ and x,y, y’ then the last equation 
is the same as Eq. 88, as required. 

4. Verify the fact that catenoid is a minimal surface using the result of Problem 4 of § 1.6. 
Answer: We use the settings of Problem 3 which satisfy the domain and boundary conditions of 
Problem 4 of § 1.6.77] Hence, according to Eq. 91 (which is an equation of a catenoid in 3D as well 
as a catenary in 2D) we have: 


[x2 ap 
z = Carccosh (4) + D (p = Va? + y?) 


C 
x 
zy => 
a 2 
fa? + y? aa —1 
—ot + yt — Oy? 
Zea 


3/2 
C2 (x2 + y2)3/? (S34 = 1) 


(771 Tn fact, there are some issues about the domain and boundary conditions that require discussion and clarification. 
However, we ignore these issues because they are not relevant or necessary for our objectives. 
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y 
[72 +y? ere ee | 
—y* as 4 = C272 
Zyy = 3/2 
C2 (x2 + y2)3/? (= = 1) 
—xy Qe + Qy? — C7) 
C2 (a? & yee? (a = rn 


Zry = 


On substituting from these equations into Eq. 17 we get 0 = 0, i.e. Eq. 17 is satisfied identically by 
the equation of the catenoid. So, the catenoid is a solution to Eq. 17 and hence its area is minimum 
(according to Problem 4 of § 1.6), ie. it is a minimal surface, as required. 


Chapter 4 
Optimal Solids 


In this chapter we present and solve problems about topics and applications of the mathematics of variation 
related to solids, i.e. we are looking in these problems to certain 3D objects that optimize something (such 
as volume). 


4.1 3D Shapes of Optimal Sides Lengths 


From the title of this section, it is fairly obvious that in this type of problems it is required to optimize 
the sum of the lengths of the sides of a given 3D shape!” (such as rectangular parallelepiped) subject to 
certain condition(s) such as being of fixed surface area. We note that due to the simplicity of this type of 
problems we use only ordinary calculus (possibly in association with the Lagrange multipliers technique) 
in solving these problems and hence no variational formulation (based on the Euler-Lagrange equation) 
is required. 

Problems 


1. What is the shape of the rectangular parallelepiped of fixed surface area that optimizes the sum of the 
lengths of all its sides? 
Answer: This is a length optimization problem with an area constraint because we are required to 
optimize the sum of sides lengths subject to the surface area constraint. So, we can solve it by using 
the Lagrange multipliers technique (see § 1.8). 
If the three dimensions of the parallelepiped are x, y, z then its sum of sides lengths is 4(a + y+ z) and 
its surface area is 2(ay+ «z+ yz). So, our Lagrange multipliers formulation is f + Ag = 4(~@+y+2)+ 


A2(xy + xz + yz) and hence we should optimize f + Ag by taking its partial derivatives with respect 
to x,y,z and setting them to zero to obtain the optimization conditions, that is: 
) 
Da [4(a+yt+2)+A%M(ayt+az+yz)]) = 44+A2y+z)=0 (92) 
3) 
Dy [4(a+y+z)+A2(aytauz+yz)] = 44+A2(ae+z) =0 (93) 
6) 
De [4(a+yt+2)+A%M(ayt+az+yz)]) = 44+A2(x+y) =0 (94) 


On subtracting Eq. 93 from Eq. 92 we get « = y while on subtracting Eq. 94 from Eq. 93 we get 
y = z. Hence, x = y = z which means that our parallelepiped is a cube. 
Note: it should be obvious that the optimal sum of sides lengths in this Problem is a minimum (not 
a maximum) because this sum will diverge if two of the dimensions (say y and z) of the parallelepiped 
approach zero (noting that the surface area is fixed). 

2. What is the shape of the rectangular parallelepiped of fixed volume that optimizes the sum of the 
lengths of all its sides? 
Answer: This is a length optimization problem with a volume constraint. So, we can solve it by 
the Lagrange multipliers technique (see § 1.8) as we did in Problem 1 (see the upcoming note 1). 
However, for diversity and comparison we solve it using a different method, i.e. by embedding the 
volume constraint within the expression of the optimized quantity (which is the sum of sides lengths). 
According to the volume constraint we have V = xyz (where x,y,z are the three dimensions of the 


[78] More technically and specifically, the 3D shapes of interest in this section have straight sides and hence they are polyhe- 
drons. So, in these problems we are optimizing the sum of the lengths of the sides of polyhedrons. 
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parallelepiped and V is the volume which is a constant) and hence z = ia So, the sum of sides lengths 
is S =4(a+y+z)=4 (x +yt+ v). Now, we should optimize S' by taking its partial derivatives with 
respect to x and y and setting them to zero to obtain the optimization conditions, that is: 


10S 
Ls Sane ee 
4 Ox uy 
10S _ a 8 0 
Ady xy? 
On subtracting the first equation from the second we get: 
V V 
if ae ee Y 
Ty xy 
aty = ay? 
c= y (dividing by xy 4 0) 
Now, if we repeat the above process but this time with y = a (instead of z = x) then we get x1 = z. 


Hence, « = y = z which means that our parallelepiped is a cube. 
Note 1: if we use the Lagrange multipliers technique then f + Ag = 4(a + y + z) + Awyz and hence: 


oF +Ag) = 4+dAyz=0 
Ox 
Pe Piexa) = 4+ Arz =0 
Oy 
6) 
ay (ftAg) = 4+Ary=0 
Oz 


which by comparison lead to x = y = z. 
Note 2: it should be obvious that the optimal sum of sides lengths in this Problem is a minimum (not 
a maximum) because this sum will diverge if two of the dimensions (say y and z) of the parallelepiped 
approach zero (noting that the volume is fixed).!79l 
3. What is the shape of the regular pyramid of square base and fixed volume that optimizes the sum of 
the lengths of all its sides?!®° 
Answer: Let a be the length of the 4 sides of the square base, A the length of the 4 slant sides, V 
the fixed volume of the pyramid, H its height, and h the height of the slant faces (see Figure 42). 
Accordingly, we have: 
1, 3V 
V==a°H and hence HA=—~ 
3 a? 
Now, the sum S of the lengths of all sides is given by: 
S = 4a+4A 


= 4a+4 


a2 BV 
= 4qa +4 —-+ ee 
onal + (Fr) 


79] Tn fact, it will diverge even if one dimension approaches zero. 
[80] “Regular” means that all the 4 slant sides of the pyramid are identical. 
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Figure 42: A schematic illustration of the setting of Problem 3 of § 4.1 showing a regular pyramid of 
square base and fixed volume. The symbols are explained in the text. 


To optimize S we take its derivative with respect to a and set it to zero to obtain the optimization 


condition, that is: 


1dS 
4 da 

a — 422 

es a 
a2 2 
Dei aa 

14 08 = 36V? 
(a® — 36V?)" 


a = 72 36-7 
a? 108Va° = 36°" 


On solving the last equation for the variable a° (using the quadratic formula), we get: 


6 -108V? + V108°V4+4 x 36°V4 
ee = ( 544 


2 


: 18vi3) Vv? 


The only physically!!! sensible root is the positive and hence a® = (—54 + 18/13) V? and thus a = 


[81] In fact, even mathematically considering our restriction to real quantities. 
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(—54 + 18/13) ie V‘/3_ So, our pyramid has 4 base sides each of length a = (—54 + isVI3) "° Vos 


and 4 slant sides each of length A = 4/ ee + ove (with a being given by the previous expression). 
Note: it should be obvious that the optimal sum of sides lengths in this Problem is a minimum (not 
a maximum) because this sum will diverge if the height H of the pyramid approaches zero or if the 
base shrinks (noting that the volume is fixed). 


4.2 3D Shapes of Optimal Surface Area 


From the title of this section, it is obvious that in this type of problems it is required to optimize the 
surface area of a given 3D shape (such as pyramid or cone) subject to certain condition(s) such as being 
of fixed volume. As in the Problems of § 4.1, we use only ordinary calculus in solving these problems 
without need for employing the variational formulation of the Euler-Lagrange equation. 
Problems 
1. What is the shape of the regular pyramid of square base and fixed volume that optimizes the surface 
area? 
Answer: If we follow the setting of Problem 3 of § 4.1 (including Figure 42) then the surface area o 
of the pyramid is the sum of the base area which is a? plus 4 triangular slant faces each of area sah. 
Hence, we have: 


To optimize o we take its derivative with respect to a and set it to zero to obtain the optimization 
condition, that is: 


dao 
— 0 
3 gov? 
2a +2 = = = 0 
aft + Oe 
6_y 2 
ona a 8V _ 0 
aa a + 
(a® — 18V?)” sos 
=> a 
a (4 + 2) 
al? — 36V7a5 +187V4 = a)? 4+36V2a% 
72V7a® = 18°V4 
2 
o. © 
2 
9 1/6 
a= (5) i" (95) 


So, our pyramid has a square base of side length a and height H = ay with an optimal surface area 


og =a? +2y/ ot + ov (where a is given by Eq. 95). 
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Note: it should be obvious that the optimal surface area in this Problem is a minimum (not a 
maximum) because the surface area will diverge if the height H of the pyramid approaches zero 
(noting that the volume is fixed). 
2. What is the shape of the right circular cone of fixed volume and optimal surface area? 

Answer: This is an area optimization problem with a volume constraint, so we solve it by the Lagrange 
multipliers technique (see § 1.8). If the radius of the cone base is R and its height is H then its surface 
area is 0 = tR? + tRVH? + R? while its volume is V = $R?H. So, what we should optimize 
(according to the Lagrange multipliers technique) is: 


f+ \g=o+W =0R? + nRVHP +R + RH 


Accordingly, we take the partial derivatives of o + AV with respect to R and H (which are the 
variables in this optimization since they control the area and volume) and set them to zero to obtain 
the optimization conditions, that is: 


10 ti 2X 
—-~=—(0+XV) = 2R4+~VH?74+ R?4+ ——— 4 RHA = 
zOR”? ) JH? + R? 3 

10 RH A 

Sao )) eS ee eR 0 

OHS ) «/ H? + R28 


Now, if we multiply the second equation by at and subtract the result from the first equation we get: 


R? 2H? 
| 24 2 on 
2R+~V/H7+ R*4 Jip = 0 


+ R2 JV H?2 + R? 
‘ie H?+R? | R? 2H? Sie 
: JH? + R2 ' JH? + R2 JH? + R2 > 
2 2_ Ff? 
i ee = 0 
H2 + R2 
(2R? — H?)? 
— CO AR? 
H? + 
4R*—4R?H*?+H* = 4R?H?+4R4 
H*—8R?H? = 0 
H?—8R? = 0 (H #0) 
H = V8R 


1/3 
So, the optimal cone has a base radius R = (2%) and height H = /8R with an optimal surface 


area o = 4 R?. 
Note: the optimal surface area in this Problem is a minimum because o can diverge when H approaches 
zero (noting that the volume is fixed). 

3. What is the shape of the rectangular parallelepiped of fixed volume and optimal surface area? 
Answer: This is an area optimization problem with a volume constraint and hence we can solve it 
using the Lagrange multipliers technique (see § 1.8). Now, if the three dimensions of the parallelepiped 
are x,y,z then its surface area is 0 = 2(xy + xz + yz) and its volume is V = xyz. So, our Lagrange 
multipliers formulation is f + A\g =o + AV = 2(xy + xz + yz) + Axyz and hence we should optimize 
a0 +XV by taking its partial derivatives with respect to x,y,z and setting these partial derivatives to 
zero to obtain the optimization conditions, that is: 


2 (c+AV) = Ay+z) +rAyz =0 (96) 
ao (o+AV) = 2ax+z)+Arz=0 (97) 
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2 (+ V) = AWe+y)+ Acy=0 (98) 


Now, if we multiply Eq. 96 with x and Eq. 97 with y and subtract the second from the first we get: 


x[Q(yt+z)+Ayz]—y[2(ea+2z)+Arz] = 0 
2a(y+z)-—2y(a+z) = 0 
2ez—2yz = O 
x-y = 0 (z £0) 
e= y 


Similarly, if we multiply Eq. 97 with y and Eq. 98 with z and subtract the second from the first we 
get y =z. 
On combining the results « = y and y = z we get « = y = z which means that our parallelepiped is a 
cube. 
Note: the optimal surface area in this Problem is a minimum because o can diverge when one 
dimension (say z) approaches zero (noting that the volume is fixed). 

4. What is the shape of the rectangular parallelepiped of fixed sum of sides lengths and optimal surface 
area? 
Answer: This Problem can be solved by a similar method to that used in Problem 1 of § 4.1. However, 
it is easier to use the proof by contradiction to show that the shape is a cube because if the cube with 
fixed sum of sides lengths (say So) is not optimal (i.e. maximum; see the upcoming note) in surface 
area then there should be a non-cube parallelepiped with sum Sp and with maximum surface area (say 
d9) and hence a cube of surface area og will have a larger sum than So which contradicts the result of 
Problem 1 of § 4.1 because the cube of surface area 9 has minimum sum of sides lengths. |®?! 
Note: the optimal surface area in this Problem is a maximum because the surface area will converge 
to zero if two dimensions (say y and z) of the parallelepiped approach zero (noting that the sum of 
sides lengths is fixed). 

5. What is the shape of the right circular cylinder of fixed volume and optimal surface area? 
Answer: The surface area of right circular cylinder is ¢o = 27 R? +27RH and its volume is V = 7R?H 
where R and H are its radius and height. We are asked to optimize o subject to a constraint on V and 
hence according to the Lagrange multipliers technique we have f + \g =o + AV = 27 R? + 20 RH + 
\nR?H. So, we take the partial derivatives of ¢o + \V with respect to R and H and set them to zero 
to obtain the optimization conditions, that is: 


10 
—~~—(0+X»V) = 4R+2H+2\RH =0 
TOR 
Dee Ged any) = 2R+AR?=0 
a OH 7 7 
Now, if we multiply the second equation with 2 and subtract it from the first equation we get: 
4R+2H—-4H = 0 
H = 2R 


So, the optimal cylinder has a height equal to its diameter with an optimal surface area o = 27.R? + 


Qn R (2R) = 6rR? where R= (¥)"”. 


Note: the optimal surface area in this Problem is a minimum because o will diverge when H approaches 
zero (noting that the volume is fixed). 


[82] For cube, S = 12x and o = 6x? (where S is the sum of sides lengths, o is its surface area, and x is the length of its sides) 
which can be combined to obtain S = 12,/a/6 and hence the sum of sides lengths increases/decreases as the surface 
area increases/decreases (and vice versa). 
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4.3. 3D Shapes of Optimal Volume 


From the title of this section, it is clear that in this type of problems it is required to optimize the volume 
of a given 3D shape (such as right circular cylinder) subject to certain condition(s) such as being of 
fixed surface area. Again, we use only ordinary calculus in solving these problems with no need for the 
Euler-Lagrange equation. 


Problems 


He 


What is the shape of the rectangular parallelepiped of fixed surface area and optimal volume? 
Answer: This is a volume optimization problem with an area constraint and hence it can be solved by 
the method of Problem 3 of § 4.2 where in this case we have f+Ag =V+A0 = xyz +A2(ayt+az+yz) 
and hence by differentiation and elimination (as we did in Problem 3 of § 4.2) we again obtain x = y = z 
which means that our parallelepiped is a cube. 

Note 1: the optimal volume in this Problem is a maximum because V will converge to zero when one 
dimension (say z) approaches zero (noting that the surface area is fixed). 

Note 2: the result of this Problem can be obtained as a corollary of the result of Problem 3 of § 4.2 
(using the proof by contradiction method) because if the parallelepiped of the given surface area (say 
oo) and maximum volume (say Vo) was not cube then the cube of the surface area oo should have 
smaller volume than Vp and hence the cube of volume Vo should have larger surface area than oo and 
this contradicts the result of Problem 3 of § 4.2 because the cube of volume Vo should have minimum 
surface area (in comparison to any other rectangular parallelepiped of the same volume).|831 
Similarly, the result of Problem 3 of § 4.2 can be obtained as a corollary of the result of the present 
Problem (using the proof by contradiction method) because if the parallelepiped of the given volume 
(say Vo) and minimum surface area (say go) was not cube then the cube of the volume Vo should have 
larger surface area than oo and hence the cube of surface area o9 should have smaller volume than Vo 
and this contradicts the result of the present Problem because the cube of surface area oo should have 
maximum volume (in comparison to any other parallelepiped of the same surface area), [84] 


. Find the volume of the rectangular parallelepiped of maximum volume with a surface area of 24 (area 


units). 

Answer: From the result of Problem 1, the parallelepiped should be a cube and hence from the 
formula of the surface area o of cube we have o = 6x? = 24 (where z is the length of its sides) which 
leads to x = 2. Now, from the formula of the volume V of cube we have V = x? = 8 (volume units). 


. What is the shape of the rectangular parallelepiped of fixed sum of sides lengths and optimal volume? 


Answer: This Problem can be solved by a similar method to that used in Problem 2 of § 4.1. However, 
it is easier to use the proof by contradiction to show that the shape is a cube because if the cube with 
fixed sum of sides lengths (say So) is not optimal (i.e. maximum; see the upcoming note) in volume 
then there should be a non-cube parallelepiped with sum Spo and with maximum volume (say Vo) and 
hence a cube of volume Vo will have a larger sum than Sp which contradicts the result of Problem 2 of 
§ 4.1 because the cube of volume Vo has minimum sum of sides lengths.|*°! 

Note: the optimal volume in this Problem is a maximum because for a parallelepiped with fixed sum 
of sides lengths if two dimensions (say y and z) approach zero the volume converges to zero.|°4 


. What is the shape of the right circular cone of fixed surface area and optimal volume? 


83 


84 
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For cube, o = 6x? and V = 2° (where o is its surface area, V is its volume, and z is the length of its sides) which can 
be combined to obtain o = 6V?/3 and hence the surface area increases /decreases as the volume increases/decreases (and 
vice versa). 

For cube, V = x? and o = 6x? (where V is its volume, c is its surface area, and x is the length of its sides) which can 
be combined to obtain V = (o/6)3/? and hence the volume increases/decreases as the surface area increases/decreases 
(and vice versa). 

For cube, S = 12a and V = 2° (where S is the sum of sides lengths, V is its volume, and z is the length of its sides) 
which can be combined to obtain S = 12V1/3 and hence the sum of sides lengths increases/decreases as the volume 
increases/decreases (and vice versa). 

In fact, it will converge to zero even if only one dimension approaches zero. 
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Answer: If we follow in our footsteps in Problem 2 of § 4.2 then we have: 


ftrd\g=Vtd0= ReH 4+ \wR? + \w RH? + R? 


Hence: 
1 0 2 \R? 
carey rae = =RH+2R+\/ 1? ++ __e= 
— 3 (V +0) RH +2\R+ + FP + ae = 0 
1a 1... ARH 
~ on VY +) = 3h + ae 


Now, if we multiply the second equation by 2H and subtract the result from the first equation we get: 


AR? 2\H? 
QAR + AV H?2 + R24 = 0 
V H2 + R2 V H2 + R2 
R? 2H? 
2R+/H? + R? => ——— ee (A 40) 
VH2+R VH2+R 
H? + R? R? 2H? 
2R+ + = 0 
VH?2+R2 JH24+R2 VH24+ R? 
2R? — H? 
2R + ——— = 0 
VHA + RB? 
H = V8R 


which is the same as the result of Problem 2 of § 4.2 (so the optimal cone has a base radius R = \/# and 


height H = /8R with an optimal volume V = mlS 3), This should be the case to avoid contradiction 
because if a cone optimizes (i.e. minimizes) the surface area for a fixed volume then it should optimize 
(i.e. maximize) the volume for a fixed surface area. In fact, we could have established the result of 
this Problem using the proof by contradiction method in conjunction with the result of Problem 2 of 
§ 4.2, but we followed this method for practice and confirmation. 
Note: the optimal volume in this Problem is a maximum because V will converge to zero when H 
converges to zero (noting that the surface area is fixed). 

5. What is the shape of the right circular cylinder of fixed surface area and optimal volume? 
Answer: We can use the proof by contradiction in conjunction with the result of Problem 5 of § 4.2 
to show that the optimal cylinder has a height equal to its diameter. However, for more practice we 
use the Lagrange multipliers technique again with f + Ag = V + \o = 7 R?H + 27AR? + 27\RH. So, 
we take the partial derivatives of V + Ao with respect to R and H and set them to zero to obtain the 
optimization conditions, that is: 


1 

120 (yp joy =! SRP RE =o 
TOR 

1 a ; 


Now, if we multiply the second equation with at and subtract the result from the first equation we 
get: 
4\R + 2\H — 4\H 0 
H = 2R 


So, the optimal cylinder has a height equal to its diameter with an optimal volume V = 7R? (2R) = 
2nR? where R = /Z. 
Note: the optimal volume in this Problem is a maximum because V can approach zero when H 
approaches zero (noting that the surface area is fixed). 
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6. What is the shape of the ellipsoid of fixed sum of semi-axes lengths and optimal volume? 
Answer: The sum of the lengths of semi-axes a, b,c is S = a+b+c while the volume is V = 4 mabe. It 
is required to optimize V subject to a constraint on S and hence we can use the Lagrange multipliers 


technique with f +Ag =V +AS = 4nabe+ A(a +b +c). On taking the partial derivatives of V + AS 


3 
with respect to a,b,c and setting them to zero we can obtain the optimization conditions, that is: 

O 4 

—— —_ — _ b = 

3a (V + AS) g mee +A=0 

O 4 

ap VV + AS) = gractA=0 

O 4 

— —_ — _ b = 

De (V + AS) gma +A =0 


By subtracting the second equation from the first equation we get a = b while by subtracting the third 
equation from the first equation we get a = c. Hence, a = b = c which means that the optimal ellipsoid 
is a sphere. 
Note: the optimal volume in this Problem is a maximum because V can approach zero when one (or 
two) of the semi-axes approach zero (noting that the sum of semi-axes lengths is fixed). 

7. What is the shape of the right circular cone of fixed slant height (or generator) and optimal volume? 
Answer: Referring to Figure 43, if the fixed slant height is H while the height and base radius are h 
and r then r = / H? — h? and hence the volume of the cone is: 


A 


r ‘i 


Figure 43: A cross section of a right circular cone of base radius r and height A with fixed slant height H. 
The cross section is through the apex A of the cone and the diameter of its base. See Problem 7 of § 4.3. 


shes feet igs ie i 
Varia ant W)h= 30 (Hh h°) 


To optimize V we take its derivative with respect to h (which is the variable since H is fixed) and set 
the derivative to zero to obtain the optimization condition, that is: 


3 dV 
at dh 
H?—3h? = 0 


4.4 3D Shapes Inside Other 3D Shapes 141 


h= — (h > 0) 


Hence, the optimal cone has h = a and r = v A? — He = J 2H with an optimal volume V = 
1 (2772) (HL) = 20_ 773 

gm (3H) (4) gee 

Note: the optimal volume in this Problem is a maximum because V can converge to zero when r 
approaches zero or when h approaches zero (noting that the slant height is fixed). 


4.4 3D Shapes Inside Other 3D Shapes 


In this type of problems it is required to optimize a certain property (such as volume) of a given 3D shape 
(such as rectangular parallelepiped) that is confined inside another 3D shape (such as ellipsoid). Again, 
no variational formulation (based on the Euler-Lagrange equation) is required because these problems are 
sufficiently simple to be solved by ordinary calculus. 


Problems 


1. What is the shape of the rectangular parallelepiped of optimal volume that is inscribed inside an 
ellipsoid of a fixed shape (noting that all the sides of the parallelepiped are parallel to the axes of the 
ellipsoid)? 

Answer: Using a rectangular Cartesian coordinate system, an ellipsoid in standard form (i.e. it is 
centered on the origin of coordinates and its three axes are aligned along the axes of the coordinate 
system) is given by the following equation: 


tet eel (99) 


where a,b,c are the lengths of the semi-axes (which are positive constants). Now, by symmetry!®7! the 
volume of the parallelepiped in each octant is equal to the volume in any other octant and hence if we 
consider the part of the parallelepiped that is in the first octant (i.e. the octant with x,y,z > 0) then 
the volume of this part is xyz (with x,y,z being the coordinates of the parallelepiped vertex in that 
octant) and hence the volume of the parallelepiped is 8xyz (since in 3D we have 8 octants)./88! So, 
our objective is to optimize f = xyz (which is equivalent to optimizing 8xyz). However, because the 
parallelepiped is inscribed inside the ellipsoid then its vertices should be on the ellipsoid and hence the 
x,y,z of the vertex in the first octant should satisfy the equation of the ellipsoid (i.e. Eq. 99). So in 
brief, we have to optimize xyz subject to the constraint of Eq. 99 and hence we can use the Lagrange 


multipliers technique by optimizing h = f+Ag = axyz+A (= + ve + =). Accordingly, we take the 
partial derivatives of h with respect to x, y, z (which are the variables of optimization since they control 
the volume of the parallelepiped) and set them to zero to obtain the optimization conditions, that is: 


Oh 20% 
ag oe ae 0 (100) 
Ooh 2dy 
ae tz+ a = 0 (101) 
Oh 2rz 
Gas re ae we) 


[871 The symmetry of the ellipsoid (as a result of being in standard form) should imply the symmetry of the parallelepiped, 
i.e. it is in standard form with its center being on the origin of coordinates and its sides being parallel to the axes of the 
coordinate system. 

[88] The reader should note that we are actually using z, y, z in two different meanings, i.e. as general coordinates (as in Eq. 
99) and as coordinates of the parallelepiped vertex in the first octant. The reason is to simplify the notation and avoid 
unnecessary complications; otherwise we can (for instance) use X,Y,Z for the coordinates of the vertex to distinguish 
between the two meanings. 
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On multiplying Eq. 100 with x and Eq. 101 with y and subtracting the results we get: 


2Qr0a7 Dy? _ 9 
er oe 
ne y? 
aa (103) 
Also, on multiplying Eq. 100 with x and Eq. 102 with z and subtracting the results we get: 
204"? Dz? 
a ° 
a 2 
or (104) 
On comparing Eqs. 103 and 104 we get: 
ay? 2? 
eo BP & 
On substituting from the last equation into Eq. 99 (once for ye and os once for = and aS and once 
for a and ve) we get: 
x? y? 22 
that is: 
a b ¢ 


v3 ae mee 


Again, from the symmetry it is obvious that x,y,z are just half the sides of the parallelepiped and 


hence the lengths of the sides of the parallelepiped are a (along the « axis), a (along the y axis), and 
a (along the z axis). Accordingly, the optimal volume of the parallelepiped is ye x = x = — ware 


(which can also be obtained from 8xyz which we used above). 
Note: the optimal volume in this Problem is a maximum because the volume approaches zero when 
one or two of the sides of the parallelepiped approach zero. 
2. What is the shape of the rectangular parallelepiped of optimal surface area that is inscribed inside a 
spheroid (i.e. ellipsoid of revolution) of a fixed shape? 
Answer: The equation of spheroid in standard form can be written as: 
a ye 


a2 b2 


=1 (105) 


If we follow in our footsteps in Problem 1 then we are required to optimize h = f+Ag = (wy + wz + yz)+ 
A (S + ve) and hence: 


Oho % 2Ar 0 
On ais az 
Oh 2ry 

ay = £+2z+ oe => 0 
Oh ae sek 2rAz 0 
Oz ree b? 


Now, from the symmetry we should have z = y and hence the last three equations will reduce to only 
two, that is: 


y+ = 0 (106) 
a 
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2N 
aty +> =a (107) 


On solving Eq. 106 for A we get A = wee and on substituting this in Eq. 107 we get: 


2a? y? 
PTO Be = 
2a2 2 
age = 0 


On solving the last equation for x (using the quadratic formula) we get: 


—yty/y2t+ Se (144/14 84 
ase = 


2 2 


where we ignored the non-sensible negative root. Now, on substituting from the last equation (plus 
z= y) into Eq. 105 we get: 


1 f-14+\/148¢ 


-1/2 
mgs 2 y f-lt4/it+ Bg" ) 
oS la 2 b? 
Accordingly, the inscribed parallelepiped of optimal surface area has dimensions: 
2 -1/2 
ee (asi (ae | | af eae) 
= 2 a? 2 e 
2 -1/2 
ae ane y f-l4/1+ Ba" 22 
a ee a2 2 ' b2 


and its optimal surface area is 0 = 8(xy + «z+ yz) where «, y, z are as given above. 
Note: the optimal surface area in this Problem is a maximum because the surface area can approach 
zero when two sides of the parallelepiped (i.e. those corresponding to the identical semi-axes) approach 
Zero. 

3. A spheroid is given by the equation x? + 3y? + 3z? = 1. Find the dimensions and the surface area of 
the inscribed parallelepiped of optimal surface area. 
Answer: On comparing this equation with Eq. 105 we get a? = 1 and b? = 1/3. Therefore, the 
dimensions and the surface area of the optimal parallelepiped are: 


2 
8 8 
-1+,/1+73\ | f-l+/lt+ds 9 
22 = -2 | 


2 1 2 


—1/2 
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2 
-14+,/1+ 7% 

1 1/3 2 
2y=2z = 2 Fl : 


2 


P ( 2 1 os 2-0 1 1 1 ) 4 
oO = } = 
V10V10 V10V10 V10V10 
4. What is the shape of the right circular cylinder of optimal surface area that is inscribed inside a sphere 
of a fixed radius? 


Answer: Let the radius of the sphere be R while the radius and height of the cylinder be r and h (see 
Figure 44). From the Pythagoras theorem we have h = 2/R? — r? and hence the surface area of the 


cylinder is: 


ee ee 


Figure 44: A cross section of a sphere of radius R in which a right circular cylinder of radius r and height 
h is inscribed. The cross section is through the center of the sphere (and the center of the cylinder) and 
along two generators of the cylinder. See Problem 4 of § 4.4. 


o = 2nr? + 2arh = 2nr? + dary R? — v2 = In Gs +2 Rr? — Z) 


To optimize o we take its derivative with respect to r (which is the optimization variable since R is 
fixed) and set it to zero to obtain the optimization condition, that is: 


1 do 
Oe hei} 
2n dr 
2Rer — 4r? 
2r + 2—— = _ 0 
a Rare — rt 
Rr — 2r 
r+ — = 0 
J R22 — 74 
(R?r _ 2r3)? 3 
Rea: 
Rr? —4R?r* + 4r® = R?r*—7® 


5r® —5R?r4+ Rtr? = 0 
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br? = 5R2r?+R* = 0 (r £0) 


On solving this equation for r? (using the quadratic formula) we get: 


1/2 
yale A 4. + + 
> BR? + V25R?—20RF (3) a ee ae (3) e 


= 10 a g 


Therefore: 


1/2 
54/5 5eV5 
=? 2_7r2=—9 2 ————_ 2—9 
h R . R ( 10 je n( 10 


On inserting the obtained expressions for r and h in the equation of the area (i.e. o = 2ar? + 2mrh) 


we get: 
1/2 = 1/2 
o = on (28) we an (2448 R an (238 


i 2 1/2 
= Dar? abv + 2 ete 

10 100 

5tV¥5\ 20 
= 2nR? — 
TR ( 10 + 5 | 
1/2 
Now, since the optimal surface area is a maximum (see the upcoming notes), we take r = (23) 


1/2 
and h=2R (5%) and hence the optimal surface area is: 


(334) V¥20 


o = 2nR? — | + | = (1 BS v5) R? ~ 10.1664R? 


5 


This is less than the surface area of the sphere which is 47 R? ~ 12.5664R? (as it should be). 
Note 1: the optimal surface area in this Problem is a maximum because o can approach zero when r 
approaches zero (and h approaches 2R).|8% 
Note 2: for more clarity, we plotted in Figure 45 the surface area o of a cylinder inscribed inside a 
unit sphere as a function of the cylinder radius r.!9° 

5. What is the shape of the right circular cylinder of optimal volume that is inscribed inside a sphere of 
a fixed radius? 
Answer: If we follow in our footsteps in Problem 4 then the volume V of the cylinder is given by: 


2 3 
V =a’ =ah(Re-) =1(n—) 


To optimize V we take its derivative with respect to h (which is the optimization variable since R is 
fixed) and set it to zero to obtain the optimization condition, that is: 
1 dV 
am dh 
[89] We should also note that the above optimal area lice. o=n(1l+ V5) R?] is also larger than the area in the other limiting 
case when h approaches zero and hence the surface area approaches 27 R?. 


199] The use of unit sphere does not affect the generality because this is equivalent to using length units scaled by 1 /R and 
using area units scaled by 1/R?. 
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12 
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Figure 45: Plot of the surface area o (of a cylinder inscribed inside a unit sphere) as a function of the 


cylinder radius r where the peak of the curve at (y/ 4x8, nm + mv/5) ~ (0.8507, 10.1664) is marked. See 
Problem 4 of § 4.4. 


2R 
v3 


Hence, the optimal cylinder has a height h = ya and a radius r = /R - he a (rR - x = Ry/3 


with an optimal volume V = 4% R°. 
Note: the optimal volume in this Problem is a maximum because V can approach zero when r 
approaches zero (and h approaches 2R) or when h approaches zero (and r approaches R). 

6. What is the shape of the right circular cylinder of optimal volume that is inscribed inside a right 
circular cone of a fixed shape? 
Answer: Let r and h be the radius and height of the cylinder and R and H the base radius and height 
of the cone (see Figure 46). Now, the triangles ABC and ADE are similar and hence: 

H-h 


TR a h=H(1-5) 


Thus, the volume of the cylinder is: 


3 
V =ar*h=ar'H (1- )) = 7H (- z) 


To optimize V we take its derivative with respect to r (which is the optimization variable since R and 
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Figure 46: A cross section of a right circular cylinder of radius r and height fh inscribed inside a right 
circular cone of base radius R and height H. The cross section is through the apex A of the cone and the 
diameter of its base (and hence through the center of the cylinder and along two of its generators). See 
Problem 6 of § 4.4. 


Hi are fixed) and set it to zero to obtain the optimization condition, that is: 


1dv 
nar 
pr 
H(a-3) = 0 
Ga), 
2-35 = 0 (H #0, r #0) 
2 
= =R 
° 3 


Thus, the cylinder of optimal volume has r = 2R and h = H (1 - 4) =H (1 - 5h) = = with an 
optimal volume V = m(ZR)yPt = RH. 
Note: the optimal volume in this Problem is a maximum because V can approach zero when h 
approaches H (or when r approaches R). 

7. What is the shape of the right circular cone of optimal volume that is inscribed inside a sphere of a 
fixed radius? 
Answer: Let R be the radius of the sphere and r and h be the base radius and height of the cone (see 


Figure 47). Now, r = Rcos@ and h = R+ Rsiné and hence the volume of the cone is: 


Ls 1 2.2 : aR? 2 20a 
VS grr re 37 (R? cos? 6) (R+ Rsin#) = nar (cos? 0 + cos” @ sin 6) 
To optimize V we take its derivative with respect to 0 (which is the optimization variable since R is 


fixed) and set it to zero to obtain the optimization condition, that is: 


3 dV 


aR? dd " 
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Figure 47: A cross section of a sphere of radius R inside which a right circular cone of base radius r and 
height h is inscribed. The cross section is through the apex A of the cone and the diameter of its base 
(and hence through the center C of the sphere). See Problem 7 of § 4.4. 


—2cos@ sin @ — 2cos@sin?6+cos?@ = 
—2sin@ —2sin?@+cos?@0 = 

—2sin6@ —3sin?6+cos?6+sin?@ = 
—2sind—3sin?@+1 = 
3sin?04+2sind-1 = 
(3sind—1)(sin?+1) = 


(cos @ # 0 since r £ 0) 


SO Or 1 1S 


Hence, sin é@ = 3 because sin = —1 is unacceptable (since r 4 0). Accordingly, cos@ = V1 — sin? @ = 
4/1 — 4 = aya and hence the optimal cone has r = 2v2 R and h = R+ = 4R with an optimal 
volume V = 47 (8R?) (4R) = 2 rR. 
Note: the optimal volume in this Problem is a maximum because V can converge to zero when h 
approaches 2R (or r approaches zero). 

8. What is the shape of the right circular cone of optimal volume inside which a sphere of fixed radius is 
inscribed? 
Answer: Let the radius of the sphere be r and the base radius and height of the cone be R and H 
(see Figure 48). Now, the triangles ABC and ADE are similar (since they are right triangles sharing 
an angle) and hence: 


, (H- rr? 
Ro H 

PP 3 H? — 2rH 
Ro H 

ry? H? — 2rH 
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Figure 48: A cross section of a sphere of radius r inscribed inside a right circular cone of base radius R 
and height H. The cross section is through the apex A of the cone and the diameter of its base (and 
hence through the center C of the circle). See Problem 8 of § 4.4. 


pen? = RAR One 
(Rae) <= QRH 
(R?—r?)H = 2rR? (H # 0) 
2r R? 
H = R2 — r2 


Now, the volume of the cone is: 
1 1 2r R? Qar Rt 
V = 57 R°H = =7R? = 
a" aC (gs) 3 (R? — 7) 


To optimize V we take its derivative with respect to R (which is the optimization variable since r is 
fixed) and set it to zero to obtain the optimization condition, that is: 


Bl ae 
27 dR 

4r R3 rR4 (2R) =, 
(=) (RP 

4r R3 (R? — 1?) aR yg 

(RP— 12)? (RR?) 
4rR°? —4r°R?—2rR? = 0 
ark? —4r°R® = 0 
R? 27? = 0 (r £0, R40) 
R = v2r 
So, the optimal cone has R = V2r and H = eet = 4r with an optimal volume V = $m (V2r)° (4r) = 


8.3 
gar. 
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Note 1: the optimal volume in this Problem is a minimum because V can diverge when H approaches 
2r (or when R approaches r). 

Note 2: for more clarity, we plotted in Figure 49 the volume V of a cone (in which a unit sphere is 
inscribed) as a function of the cone radius R.!9!) 
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Figure 49: Plot of the volume V (of a right circular cone in which a unit sphere is inscribed) as a function 


of the cone base radius R where the minimum of the curve at (/2, 8) ~ (1.4142, 8.3776) is marked. See 
Problem 8 of § 4.4. 


4.5 Solids of Revolution of Optimal Volume 


From the title of this section, it is clear that in this type of problems it is required to optimize the volume 
of a solid of revolution generated by revolving a certain curve [which is usually represented by a function 
y = y(x)| around a straight line (which is usually the x axis). As we will see, some of these problems 
can be solved by ordinary calculus without need for the variational formulation of the Euler-Lagrange 
equation. 
Problems 
1. Find and plot the parabola y(x) that passes through the points (0,0) and (1,1) and optimizes the 
volume generated by its revolution around the x axis. 
Answer: The equation of parabola is y = ax? + bx + c where a,b,c are constants and a 4 0. Now, 
since it passes through (0,0) then c = 0, and since it passes through (1,1) then a + b = 1 and hence 
b=1~-a. Therefore, the parabola should be given by y = ax? + (1—a)x. Now, the volume generated 
by the revolution of this parabola around the x axis between (0,0) and (1,1) is given by: 


1 1 
wey) ny de =n f [ax? + (1—a)a]” dx 
0 0 


[91] Again, the use of unit sphere does not affect the generality because it is a scaling issue. 
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So, if the volume is optimum then it should be stationary with respect to variations in the parameter 
a of the parabola (which controls the shape of the parabola and hence the volume), that is: 


ee es th 
e{f jac? + (1 =a) a)” ae} Soi (108) 


[ la aaa") oe (109) 
[ {2 [ax? +(1—a)a][x*—a]} dx = 0 


2 | [ax* + (1—a) 2° —ax*—(1—-a)a*] dx = 0 
0 
1 
[ [act + (1-20) 2° - 1-0) 2] dx = 0 
0 
1 
a5 (1-20) 4 (1-4) 3 - 9 
5 4 3 " 
a (1—-2a)_ (1—a) S 
E ml 3 [(0o+0-0] = 0 
12a + 15 — 30a — 20 + 20a 
= 0 
60 
2a—5 
= 0 
60 
5 
a= = 
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Therefore, the volume-optimizing parabola is given by: 


y=5e+(1 se 30" a 


This parabola is plotted in Figure 50. 
Note 1: regarding the differentiation of the integral (see Eqs. 108 and 109) we are using the fact that 
if f is a function of two variables (¢ and x) and 


B 
o)= ffx) de 
(with a and £ being constants) then 


dt 


do(t)  f? Of (t,x) 
[ AE dx 


In fact, we do not need this complication (i.e. differentiating the integral) because we can integrate 
first (without differentiation) to obtain an expression for the volume as a function of a, that is: 


Bo 
Ver (: aE “) (see Problem 2) 


30 


This expression can then be differentiated with respect to a and set to zero, that is: 


7 =1 (A) =0 and hence 2a—5=0 
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Figure 50: Plot of the parabola I that passes through the points (0,0) and (1,1) and optimizes the volume 
generated by its revolution around the x axis. See Problem 1 of § 4.5. 


which leads to a = 5/2 (as before). However, we followed this method (which requires differentiating 


the integral) for diversity and practice.|°?! 


Note 2: although the immediate objective here is to find a curve, the ultimate objective is to find the 
shape of a 3D solid object (of optimal volume) and hence the Problem is classified as an optimal solid 
issue. The sufficiency of finding a curve is due to the fact that we are dealing with a solid of revolution 
which is completely defined by its profile curve (plus the «x axis). 

2. Show that the solution that we obtained in Problem 1 is a minimum. 
Answer: It should be intuitively obvious that the solution in Problem 1 is a minimum because for 
some parabolas (passing through the designated boundary points) the volume can diverge. However, 


we can show this formally by calculating the volume (as a function of a) and plotting it, that is: 


I 


4 


1 
— rf la?x* + 2ax? — 2a2x 
ae: 
1 = 
if [a7a* + (2a — 2a 7) et 


i 
Jets (BES) as 
3 0 


lI 
> 


0 


1 
a fax? + (1—a)a]* dx 


2,3 


+2? — Qar? + ax” | dx 


(1 — 2a+ a’) z| dx 


= TT 


ae foe 
| 


Gara a 


1-2a+a? 
3 


10 — 


T 


= 6a? 15a — 15a? 
a 30 30 


20a + 10a? 
30 


192] In fact, this method may also reduce the required algebra and can help in simplifying the integration (which can be 


essential in some cases). 
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a? —5a+10 
= TT SEE 
30 


On plotting the volume as a function of a (see Figure 51) we can see clearly that the volume has a 
minimum (V = 7/8) at a = 5/2 and hence the solution is a minimum (rather than a maximum or 
inflection). 
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Figure 51: Plot of the volume V of the parabola of Problem 2 of § 4.5 as a function of a where the lowest 
point (5/2, 7/8) is marked. 


3. Find the cubic curve y(x) that passes through the points (0,0) and (1,1) and optimizes the volume 
generated by its revolution around the x axis. Also, plot the curve between # = 0 and x = 1. 
Answer: The equation of cubic curve is y = ax? + bar? +cx+d where a, b,c, d are constants and a £ 0. 
Now, since the curve passes through (0,0) then d = 0, and since it passes through (1,1) then a+b+c = 1 
and hence c = 1—a—b. Therefore, the curve should be given by y = ax? + ba? + (1—a—) a. Now, 
the volume generated by the revolution of this cubic curve around the x axis between (0,0) and (1, 1) 


is given by: 
1 
Vi= [mre 
0 
: 2 
= rf [ax® + bx? +(1—a—b)a]° dx 
0 


1 
| 
0 


azax® + 2abr? + (—2a? —2ab+b7 + 2a) vi + 


(—2ab — 2b? + 2b) et (a? + 2ab + b? — 2a — 2b+ 1) x? | dx 
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Now, if the volume is optimum then it should be stationary with respect to variations in the parameters 
a and b of the curve (since these parameters control the shape of the curve and hence the volume), 


that is: 
10V 16 1 4 
moa 05° 0 ie 
10V 1 1 1 
Roe 00 i a 
On solving this system of simultaneous equations we obtain a = 7 and b = —8 and hence c= 1—7+8 = 


2. Accordingly, the cubic curve is represented by the function y = 7x? — 8x? + 2x and it is plotted in 


Figure 52. 


Figure 52: Plot of the cubic curve y = 7x? — 8x? + 2x between x = 0 and x = 1. See Problem 3 of § 4.5. 
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Table 2: A table for the values of V as a function of a and 6 where a minimum (for V) can be seen at 
(a,b) = (7, —8). Refer to Problem 4 of § 4.5 for details. 


b 
-8.03 -8.02 -8.01 -8.00 -7.99 -7.98 -7.97 
6.96 | 0.2102937 | 0.2101157 | 0.2099586 | 0.2098225 | 0.2097073 | 0.2096130 | 0.2095397 
6.97 | 0.2100319 | 0.2098853 | 0.2097597 | 0.2096549 | 0.2095712 | 0.2095083 | 0.2094664 
6.98 | 0.2098180 | 0.2097028 | 0.2096086 | 0.2095353 | 0.2094829 | 0.2094515 | 0.2094410 
6.99 | 0.2096519 | 0.2095682 | 0.2095053 | 0.2094634 | 0.2094425 | 0.2094425 | 0.2094634 
a | 7.00 | 0.2095338 | 0.2094814 | 0.2094500 | 0.2094395 | 0.2094500 | 0.2094814 | 0.2095338 
7.01 | 0.2094634 | 0.2094425 | 0.2094425 | 0.2094634 | 0.2095053 | 0.2095682 | 0.2096519 
7.02 | 0.2094410 | 0.2094515 | 0.2094829 | 0.2095353 | 0.2096086 | 0.2097028 | 0.2098180 
7.03 | 0.2094664 | 0.2095083 | 0.2095712 | 0.2096549 | 0.2097597 | 0.2098853 | 0.2100319 
7.04 | 0.2095397 | 0.2096130 | 0.2097073 | 0.2098225 | 0.2099586 | 0.2101157 | 0.2102937 


4. Show that the solution that we obtained in Problem 3 is a minimum. 


Answer: It should be intuitively obvious that the solution in Problem 3 is a minimum because for 
some cubic curves (passing through the designated boundary points) the volume can diverge. However, 
we can show this formally by calculating the volume (as a function of a and 6 as given by Eq. 110) 
around (a,b) = (7, —8) where we can see (refer to Table 2) that the point (a,b, V) = (7, —8, 0.2094395) 
is a minimum. 

. Find the shape of the closed solid of revolution of fixed surface area and optimal volume.|9*1 
Answer: Because the surface is closed we can (without loss of generality) represent the generating 
curve [ by a function y (x) with its two end points (A and B) being on the x axis (and hence the 
surface is generated by rotating I around the z axis). We can also (without loss of generality) put the 
point A at the origin of coordinates. So, the setting of this Problem can be depicted schematically as 
in Figure 53. 

Our objective is to optimize the volume of this solid subject to the constraint on its surface area 
and hence we use the Lagrange multipliers technique (see § 1.8). Now, the volume is given by the 
integral [)” my? dx while the surface area is given by the integral [.2ryds = fj? 2ryy/1+ y? de. 
So, F = ry? and G = 27y\/1+ y? and hence H = F + AG = ry? + A27y\/14 y?. Using Beltrami 
identity (i.e. Eq. 3 with H replacing F’), we have: 


OH 
od aa => C 


(4? + Ary V/1 + y”) - Va (my? + AQryV1+ y”) ans 5 


9 / 
ny? + AWryV/1 + y? —y! (21s | = C 
y 
A2ryy’ 
ny? + AWryV/1 + y? — vies ane 
ee A2my (1+ y'?) — A2ryy? _o¢o 
/1+y!? 
A2TY 
2 = 
Taegan = 


[93] “Closed” here means that the end points of the revolving curve that generates the surface are fixed in space and hence 


they do not rotate. In other words, the end points are on the axis of revolution (i.e. the line which the curve revolves 
around). 
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Figure 53: A simple sketch depicting the setting of Problem 5 of § 4.5 where the curve I revolves around 
the x axis to generate a closed solid of revolution. 


Now, because the curve passes through the origin (where y = 0), C is zero. Hence: 


phe = eG 
/T +4 y” 
2A 
y+ ——— = 0 
eee y” 
2A 
14+y” epee 
y 
A? 
12 
ye? Sea 
y? 
dy 4)? — y? 
dx y 


—_ = ju 
4d? — y? 


—V 4? — y? rt+tC, 


Now, because the curve passes through the origin (where x = 0 and y = 0), C, = —2, and hence: 
—V 4 — y? x—2r 
AN —y? = (#—2A)° 
(2—2\)?+y? = 4)? 


This is an equation of a circle with center (24,0) and radius 2\. Therefore, the optimal solid of 
revolution is a sphere with center (2A, 0, 29) and radius 2\ (with zo being a given number representing 
the z coordinate of the center). 

Note 1: the optimal volume in this Problem is a maximum because V will converge to zero when the 
revolving curve (and hence the surface) approaches the axis of rotation (or when the two sides of the 
curve approach each other). 

Note 2: because y(x) is presumed to be a function over its domain, we exclude one-to-many curves. 
However, this will not affect the generality of the result noting that the optimal volume is a maximum 
(not a minimum). 

Note 3: in any specific problem of this type, 4 can be determined from the given area o of the surface. 
For example, if the given area is 167 then we have: 


o = lor 


4.6 Solids of Revolution of Optimal Surface Area 157 


4n (2A)? = 160 
167\2, = = 160 
A = 1 


where in line 2 we used the formula for the area of sphere (i.e. o = 47R?). Accordingly, the equation 
of the circle is (x — 2)” + y2 =4 and the equation of the sphere is (x — 2)? + y2 + (z — 2)? =4. 


4.6 Solids of Revolution of Optimal Surface Area 


From the title of this section, it is clear that in this type of problems it is required to optimize the surface 
area of a solid of revolution generated by revolving a certain curve [which is usually represented by a 


function y = y(x)] around a straight line (which is usually the « axis). 


Problems 


1; 


What is the shape of the solid of revolution of fixed volume and optimal (i.e. minimum) surface area? 
Answer: It should be sphere. This is a consequence (or corollary) of the result of Problem 5 of § 4.5 
because if the solid of the given volume (say Vo) and minimum surface area (say 09) was not sphere 
then the sphere of the volume Vo should have larger surface area than oo and hence the sphere of 
surface area 09 should have smaller volume than Vo and this contradicts the result of Problem 5 of § 
4.5 because the sphere of surface area o9 should have maximum volume (in comparison to any other 
solid of revolution of the same surface area).!%4I 


4.7 Solids of Optimal Surface Area 


In this type of problems it is required to optimize the surface area of solids of arbitrary shapes subject to 
certain condition(s) such as having fixed volume. 


Problems 


1, 


What is the shape of the solid of fixed volume and optimal (i.e. minimum) surface area?! 

Answer: This is an area optimization problem with a volume constraint. However, we do not need to 
use the Lagrange multipliers technique or the formalism of the calculus of variations directly. What we 
will use in this answer is an intuitive approach based on the use of the calculus of variations indirectly, 
i.e. as applied to Problems of lower dimensionality that we investigated earlier. This will be clarified 
in the following discussion. 

Let have a solid of arbitrary shape which has a given volume. Suppose that we divide this solid into 
infinitesimal slices each of width ds and arbitrary cross sectional shape. The volume of each one of 
these slices is obviously the product of the cross sectional area times the width ds. Now, according to 
Problem 11 of § 2.4 the shape of the closed plane curve of shortest length that encloses a given area is 
a circle and hence the slice can be replaced by a circular disc of width ds and of the same volume (and 
hence the same cross sectional area). Accordingly, the surface area of the circular disc (which is equal 
to its perimeter times ds) is smaller than the surface area of the original slice (because the perimeter 
of the disc is shorter while ds is common to both), and since the surface area is the sum of the surface 
areas of the slices then the surface area of the original solid is larger than (or equal to) the surface area 
of the stack of discs. Now, the stack of discs (assuming that they are centered on a common axis) is 
a surface of revolution whose volume is the same as the volume of the original solid while its surface 
area is less than (or equal to) the surface area of the original solid. Referring to Problem 1 of § 4.6 
the solid of revolution of a given volume and minimum surface area is a sphere, and hence the surface 
area of the sphere of the given volume should be less than (or equal to) the surface area of the stack of 


194] For sphere, V = $rR3 and o = 4rR? (where V is its volume, o is its surface area, and R is its radius) which can be 


(<)°” 


combined to obtain V = 4n 


3 and hence the volume increases/decreases as the surface area increases /decreases 
(and vice versa). 


195] “Solid” here means 3D continuous ob ject with no cavity. 
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discs. Accordingly, the surface area of the sphere of the same volume should be less than (or equal to) 
the surface area of the original solid, i.e. the surface area of the sphere is the minimum of all surface 
areas of all solids of the same volume. So, the shape of the solid of fixed volume and optimal (i.e. 
minimum) surface area is sphere. 
Note: the optimal surface area in this Problem is a minimum because the surface area of some 3D 
shapes will diverge when one of the dimensions approaches zero (noting that the volume is fixed). 
This can also be concluded from the results of the Problems that we used in our answer of the present 
Problem. 

2. Find the shape of the solid of minimum surface area with a volume V = 367. Also, find the surface 
area of this solid. 
Answer: From the answer of Problem 1 the solid should be sphere. From the formula of the volume 
of sphere of radius R we have V = anR? = 367 and hence R = 3. Therefore, the solid is a sphere of 
radius R = 3. Also, from the formula of the surface area o of sphere of radius R we have o = 47 R? = 
dn x 3? = 367. So, our solid is a sphere of surface area 367 (area units) and volume 367 (volume 
units). 


4.8 Solids of Optimal Volume 


In this type of problems it is required to optimize the volume of solids of arbitrary shapes subject to 
certain condition(s) such as having fixed surface area. 


Problems 


1. What is the shape of the solid of fixed surface area and optimal (i.e. maximum) volume?! 
Answer: This is a volume optimization problem with an area constraint. We can follow a similar 
approach to that used in Problem 1 of § 4.7 to show that the shape is a sphere. However, it is easier 
to use the result of Problem 1 of § 4.7 (in conjunction with the proof by contradiction method) to 
obtain this result because if the solid of the given surface area (say oo) and optimal (i.e. maximum) 
volume (say Vo) was not sphere then the sphere of the surface area go should have less volume than 
Vo and hence the sphere of volume Vo should have larger surface area than oo and this contradicts the 
result of Problem 1 of § 4.7 because the sphere of volume Vp should have minimum surface area (in 
comparison to any other solid of the same volume).!97 
Note: the optimal volume in this Problem is a maximum because the volume of some 3D shapes will 
converge to zero when one of the dimensions approaches zero (noting that the surface area is fixed). 

2. Find the shape of the solid of maximum volume with a surface area o = 167. Also, find the volume of 
this solid. 

Answer: From the answer of Problem 1 the solid should be sphere. From the formula of the surface 
area of sphere of radius R we have o = 41 R? = 167 and hence R = 2. Therefore, the solid is a sphere of 
radius R = 2. Also, from the formula of the volume V of sphere of padi R we have V = $7R3 = Sr. 


So, our solid is a sphere of surface area 167 (area units) and volume 7 (volume units). 


4.9 Solids of Revolution of Optimal Resistance to Fluid Flow 

In this section we present and solve some variational problems related to solids of revolution whose 
resistance to fluid flow is optimized. 

Problems 


1. Let have a solid of revolution that is totally submerged in a fluid and it is in a state of uniform motion 
relative to the fluid along its axis of symmetry. Find the shape of this solid such that the resistance to 


[96] Again, “solid” here means 3D continuous object with no cavity. 
197] For sphere, o = 47 R? and V = $rR3 (where o is its surface area, V is its volume, and R is its radius) which can be 
ay \ 2/3 
combined to obtain 0 = 47 (3%) and hence the surface area increases/decreases as the volume increases/decreases 


(and vice versa). 
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Ya Flow 


Figure 54: A simple sketch depicting the setting of Problem 1 of § 4.9 where a solid of revolution offers 
minimum resistance to fluid flow. 


the flow is minimum. Use the assumption that the resistance R experienced by such a solid is given 
by: 


L 
R« ‘| yy’? dx 
0 


where the profile of the object is described by the function y = y(«) with the boundary conditions 
y(0) = 0 and y(L) = R and where the flow is in the positive x direction (see Figure 54). 
Answer: From the given information we can write: 


i 
r=c | yy? dx = Ily] 
0 


where C' is a constant, R (which is the resistance to fluid flow) represents the functional integral I 
that should be minimized, and y (which is the profile of the solid of revolution and hence it completely 
determines its shape subject to the boundary conditions) represents the minimizing function that we 
want to find. Accordingly, F(z,y,y’) = yy’? which is independent of 2 and hence we can use the 
Beltrami identity (ie. Eq. 3), that is: 


) 
(yy’*) _ Vay (yy’*) 


yy —y' (3yy") = C 
yy” — 3yy% = C 
yy = C1 (C1 = —C/2) 
yy = Cy (C2 = Or) 
y/ dy = Codz 
oy = Oor+03 
y*/3 = Cyr+Cs 
y = (Cix+C;)°"* 


Now, from the boundary condition y(0) = 0 we have Cs = 0 and from the boundary condition y(L) = R 
we have Cy = R4/3 /L. Hence, the shape of the resistance-minimizing solid of revolution is given by its 
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profile curve: 


R4/3 aS av \ 3/4 
an ( L °) eas (z) 
2. Re-solve Problem 1 but assume this time that the resistance R experienced by the solid object is given 
by: 


L 
R « f yy”? dz 
0 


Also, plot the profile of the object assuming that the boundary conditions are y(0) = 0 and y(2) = 1. 
Answer: Following in our footsteps in Problem 1 we have F(z,y,y’) = y?y’? and hence the Euler- 


Lagrange equation in this case is (see Eq. 3): 


(yy) — U5 yy") = 
yy? —y' (y*y') = C 
yy? — yy? = C 
yy” = Cy 
yy = Cy 
ydy = Codz 
wv = Cor+ C3 


YY = 2y Cyn a Cs 
Now, from the boundary condition y(0) = 0 we have Cs = 0 and from the boundary condition y(2) = 1 
we have Cy = 1/2. Hence, the profile of the object is given by y = +,/a/2 and it is plotted in Figure 
55. 


Figure 55: A simple sketch depicting the profile y = +,/2/2 of the solid of revolution of minimum 
resistance to fluid flow according to Problem 2 of § 4.9. 


Chapter 5 
Geometrical Optics 


One of the main foundations of the classical geometrical optics is Fermat’s principle which states that light 
travels along paths that take least time, and hence this principle is essentially a variational principle. For 
example, in a vacuum or in a uniform medium the light should propagate along straight paths because 
least time (according to Fermat’s principle) implies least distance (noting that the shortest paths in 
Euclidean spaces are straight lines and that the speed of light is fixed). Although this principle is not 
strictly correct!®*! and does normally apply only in geometrical optics (where the path of light is much 
longer than its wavelength), it still has many useful applications. 

In this fairly short chapter we investigate a number of variational Problems related to common (and 
simple) applications of geometrical optics. These Problems are essentially based on Fermat’s principle 
of least time. It is noteworthy that some of these Problems are so simple that they can (and will) be 
solved by ordinary calculus with no need for the variational formulation of the calculus of variations (as 
represented by the Euler-Lagrange equation in its various forms and shapes). 


Problems 


1. Use Fermat’s principle to conclude that light in vacuum and homogeneous media follows straight path. 
Answer: This rather trivial result can be easily concluded from Fermat’s principle because in Eu- 
clidean space (which is the space of classical geometrical optics) the shortest path connecting two 
points (directly) is straight line (see § 2.1). Now, since in vacuum and homogeneous media the speed 
of light is constant then least time (according to Fermat’s principle) means shortest distancel®®! which 
is the (length of) straight line according to the aforementioned Euclidean geometrical fact. 

2. Generalize the result of Problem 1 to include non-Euclidean spaces. 

Answer: The equivalent to “straight line” in Euclidean spaces is “geodesic” in non-Euclidean spaces 
(or indeed more general). So, by the logic of Problem 1 Fermat’s principle should lead to the conclusion 
that light in vacuum and homogeneous media follows geodesic paths in general (i.e. both in Euclidean 
spaces and in non-Euclidean spaces).|!°0l 

Note: it should be noted that in this Problem (as well as in Problem 1) we are assuming direct 
propagation of light without being affected by subsidiary phenomena (like reflection) that divert the 
light from its original geodesic trajectory. 

3. Derive the law of light reflection (i.e. the angle of incidence is equal to the angle of reflection) using 
Fermat’s principle. 

Answer: In this Problem the light is presumed to propagate in a single uniform medium (or in 
vacuum) and hence it has a constant speed throughout its journey from the point of departure A to 
the point of destination C passing through the point of reflection B (see Figure 56).!!°1 Accordingly, 
least time (according to Fermat’s principle) is equivalent to least distance of travel. So, what we need 
to do to solve this Problem is to find the path ABC of minimum length by considering the variations 


98] In fact, this principle may be rectified by changing it from being a minimization principle (as implied by least time) to 
be a principle for obtaining stationary points (where these stationary points can be minimum or maximum or inflection 
or saddle). The details (which are essentially of physical significance and hence they are of little interest to us as 
mathematicians of variation) should be sought in the literature. 

99] This is due to the direct relation between distance and time, i.e. d = ct where d is distance, c is speed and t is time. 

100] In fact, we need to provide a rigorous definition for “homogeneous” in non-Euclidean spaces (at least over certain patches 

of the space). We should also consider possible position dependency of the speed and its local and global significance. 

101] The presumption that the path between A and B and between B and C is straight is based on the result of Problem 

1. So, in reality we are applying Fermat’s principle twice: once on the sub-paths AB and BC, and once on the entire 
path ABC. In other words: in this Problem we are using Fermat’s principle in both propagation (along AB and BC) 
and reflection (along ABC). 
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Figure 56: A simple sketch depicting the setting of the derivation of the law of light reflection (see Problem 


3 of § 5). 


of the angle of incidence 6; and the angle of reflection 02 (noting that these angles vary as the point of 
reflection B varies and hence they are functions of the position of B and are correlated to the length 
of the path ABC). In fact, this Problem can (and will) be solved using simple geometry, algebra and 
ordinary calculus (supported by the variational principle) with no need for the calculus of variations 
(as normally represented by the Euler-Lagrange equation). 

Now, the length s of the path ABC is the sum of the length of AB and the length of BC where each 
one of these lengths can be obtained from the Pythagoras theorem, that is: 


sa fertyty(L-2) +¥ 


So, by the variational principle s should be stationary at its extremum and hence: 


ds 
dx 
2x , —2(L-2) 
Vat ut a/(E x) +43 
x (L— 2) 
Jer +y? (b-aP +8 


sin 0; — sin A 


sin 6; 


which is the law of light reflecti 


va 


on. 


0 (see Figure 56) 


5 (considering the restrictions 0 < 01,02 < 5) 


Note 1: it should be obvious that the optimal path in this Problem is a minimum because the path 
of light (and hence the time of travel) can diverge when point B moves far away.!!°7] However, in the 


[102] As we noted earlier in a similar context, although this sort of arguments may not rule out the possibility of a local 
maximum it is still useful in supporting our intuitive conclusion (noting the secondary importance of issues like this for 


our variational objectives). 
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following we establish this technically by testing the sign of the second derivative of s, that is: 


Ps 1 a | 1 (hae 
we Vern ti? faa +d [oats] 
_ @tyi-a® | b-2) +8 -(b-2)’ 
wR? [(o-ay? +8)” 
a yi 2 >0 
+? [2+] 


So, the optimal s is a minimum and hence the obtained result is consistent with Fermat’s principle 
(even in its restricted form as a principle of least time). 

Note 2: in the above formulation and arguments we assumed implicitly that the path of light is in a 
plane that is perpendicular to the plane of the mirror (where the latter plane is the tangent plane if 
the mirror is not flat). This may also be justified by Fermat’s principle (noting that the path in other 
planes is not optimal relative to the path in the perpendicular plane even if the condition 0; = 62 is 
satisfied). 

4. A light ray is emitted from point (3, 12,5) [in a 3D Euclidean space coordinated by a Cartesian system] 

toward a flat mirror in the zy plane. If the reflection of this ray is required to pass through the point 
(9, 13, 45), what the initial direction of the ray should be? 
Answer: If the mirror does not exist then it is required (due to the symmetry and the law of reflection) 
that the ray should pass through the point (9,13,—45). Hence, the initial direction should be along 
the line passing through the initial point (3, 12,5) and the final point (9,13, —45), i.e. along the vector 
(9 — 3,13 — 12, —45 — 5) = (6,1, —50). 

5. Derive Snell’s law of light refraction using Fermat’s principle. 

Answer: Snell’s law of refraction states that the path of a light ray crossing the boundary between 
two propagation media of different refractive indices!!! satisfies the following relation (see Figure 57): 


sin 6; Cy no 


sinfOg co my 


where 6; and 62 are the angles of incidence and refraction, c; and c2 are the speeds of light in medium 
1 and in medium 2, and n, and ng are the refractive indices of medium 1 (where the source of light) 
and medium 2 (where the receiver of light). So, let see how this law can be derived from Fermat’s 
principle. 

The length s of the path ABC is the sum of the length s; of AB and the length s2 of BC where each 
one of these lengths can be obtained from the Pythagoras theorem, that is:!!4! 


s=sts= fe? +y3 + ((b-2) +¥3 


The total time of travel ¢ (which we should minimize according to Fermat’s principle) is the sum of 
the time ¢; along AB and the time tz along BC, that is: 


2 2 
a ee gary ee ett Seite 
C1 C2 Cy C2 


[103] The refractive index n of a medium (for light propagation) is the ratio of the speed of light in vacuum c to the speed 
of light in that medium cm, that is n = c/cm. We should note that both media are presumed optically homogeneous; 
moreover one of the media (and possibly both as a special case) could be vacuum (whose refractive index is 1). 

[104] Again, the presumption that the path between A and B and between B and C is straight is based on the result of 
Problem 1. So, in this Problem we are actually using Fermat’s principle in both propagation and refraction. 
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Figure 57: A simple sketch depicting the setting of Problem 5 of § 5 regarding the derivation of Snell’s 
law of light refraction. 


Now, by the variational principle t should be stationary at its extremum and hence: 


dt 
Bere oe. GQ) 
dx 
x (L— 2x) a, 
cir? + y? C9 (L—2)? +42 
ind sin 0 
PES Eee a 50 (see Figure 57) 
C1 C2 
sin 6; ey 
sin Ag 7 C2 
sinf; —— e/ny 
sin@z  —c/ng 
sin 0, ne 
- 4 111 
sin O5 Ny ( ) 


which is Snell’s law of light refraction. 

Note: again, the optimal path in this Problem is obviously a minimum because the path of light 
(and hence the time of travel) can diverge when point B moves far away. This can also be established 
technically by testing the sign of the second derivative of t (as done for s in Problem 3). 

6. A light ray is emitted from point (5,3) [in a plane coordinated by a Cartesian system] and it is required 
to reach point (10, —23). If the refractive index for y > 0 is ny = 1.2 and the refractive index for y < 0 
is n2 = 1.35, what are the coordinates of the point on the boundary through which the ray passes? 
Also, what are the angle of incidence 0; and the angle of refraction 02? 

Answer: Referring to Problem 5 (and Figure 57), we have L = 10 —5 = 5, sind; = 
5-2@ 
J (5-2)? +(—23)?” 


zx 
Jers and 


sin A = 


Hence, from Eq. 111 we get: 


r V6 — x) + (-23) 
Jae 5a 


1.125 
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x ~ 0.6424877425 


r 


So, the coordinates are (5.6424877425, 0). Also: 


6, =~ arcsin ( USE US ) ~ 0.210976 rad ~ 12.088012° 
V/0.64248774252 + 32 
6. = arcsin eee ~ 0.187271 rad ~ 10.729814° 


V (5 — 0.66424877425)" + (—23)° 


7. Find the shape of the path traced by a ray of light moving in a material medium whose refractive 
98] 


index is proportional to y (y > 0; see Figure 58 


O a 


Figure 58: A schematic illustration of the setting of Problem 7 of § 5 where a light ray moves from point 
A(az4,ya) to point B(«g, yg) along the path I (which is confined to the xy plane) in a material medium 
whose refractive index is proportional to y (i.e. n = ay with a being a positive constant). 


Answer: According to Fermat’s principle the time ¢ for the propagation of the light ray between point 
A (source) and point B (destination) should be a minimum. So, our minimized functional I[y] should 
be an integral representing t¢, that is: 


d 2B 1 12. zB 1 12d. cB 
i= fa-[S-/ yore =| von atl jee 
T r Uv LA Cm x x 


a ef (ay) Cha, 


where in step 2 we used v = ds/dt (and hence dt = ds/v), in step 3 we used ds = \/1+y/2dx and 
UV = Cm (with cp, being the speed of light in the material medium), and in step 4 we used the fact 
that the speed of light c,, in a material medium with refractive index n is given by c/n [noting that 
in this case n = ay (with a being a positive constant) because the refractive index is proportional to 


1105] In this Problem (and other similar Problems which will follow) we are assuming that the speed of light (and hence 
the refractive index) is a function of position but not of direction, i.e. the medium is optically isotropic although it 
is not homogeneous. We also consider the path of the ray to be confined to the xy plane. It should also be noted 
that the condition y > 0 does not impose any real restriction on the formulation of this Problem (neither physical 
nor mathematical) because we can always choose our coordinate system so that the entire path is in the region y > 0 
(although y < 0 or using a negative proportionality constant may be dealt with as a different problem). 
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y]- As we see, F' in this Problem is identical to F' in the problem of catenary (see Problem 1 of § 2.3). 
Accordingly, the shape of the light path should also be a hyperbolic cosine, that is: 


D 


peek (=) (112) 


8. Find the shape of the path traced by a ray of light moving in a material medium whose refractive 
index is proportional to 1/y (y > 0; see Figure 59). 


O 6 


Figure 59: A schematic illustration of the setting of Problem 8 of § 5 where a light ray moves from point 
A(ava4,ya) to point B(awg, yz) along the path I (which is confined to the xy plane) in a material medium 
whose refractive index is proportional to 1/y (i.e. n = b/y with b being a positive constant). 


Answer: On following the reasoning and method of Problem 7 we get: 


ee eee Vit+y?dx _b 2 VITT iy 
T ity U Cm LA c/( b/y) “LA 


/ 12, 
where b is a positive constant. As we see, fF = we and hence the Euler-Lagrange equation (see 
Eq. 3) is 


VET 12 (4) — 


y Oy’ y 
Mg y! _ oC 
y y 1+y? 
Jl+y? y? _ oC 
y yVf1l+y? 
1l+y” y? = C 
yV1l+y? yV1 + y?2 
1 
en 
yfit+y? 
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1 
(1+y") Cc 
12 1- Cy? 
Yo C2y? 
ne a2 aie = Cty 
y= SF C2y? 
C2y2 
4/7 6 a ae = dz 
aie = £+D 
1 
1 
Co i ele (113) 


So, the shape of the light path is a circular arc centered on the x axis. 

9. Find the shape of the path traced by a ray of light moving in a material medium of refractive index 
n=e¥ (y>0). 
Answer: Following in our footsteps in the previous Problems we get: 


d 2B 1 12. cB 1 2d. 1 cB 
rf a=[S=/ vere -/ eee =i i. /lagide st 
T Tt Uv LA Cm LA c LA 


c/eY 


As we see, F = e¥\/1+ y’ and hence the Euler-Lagrange equation (see Eq. 3) is 


Lye ay (evi +y?) =. 
2 
ev/1+y” ey: = C 
1 yl? 
TU SEES 2. Bes 
/1+y'" | ce yl? a 
eY 
1+y ae 
ev = C(1+y?) 
oe ha. 2y _ C2 
y — C2 
y = Ve 
= ae 
C dy = 
Vev-C2 
2y _ C2 
arctan (“| = r+D (114) 


10. Find the shape of the path traced by a ray of light moving in a material medium of refractive index 
n = a/p where a is a positive constant and p is a polar coordinate (assuming the path being in a plane 
coordinated by polar coordinates p, ¢). 

Answer: Following in our footsteps in the previous Problems (but using polar coordinates) we get: 


_ fds coe + p2(dd)? pow 4/1 4p? (db/dp)? 
(ad a 


c/ (a/p) 
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a Vit Pd? | =] 
PA 


p= 1/4) 
c p 
where in step 3 we used the expression for the line element ds in polar coordinates and v = cm. AS 


we see, F'(p,¢,¢') = p-'\/1 + p?2¢ and hence the Euler-Lagrange equation (see Eq. 4 noting the 
correspondence between x,y, y’ and p, ¢, ¢’) is: 


: (o Vite?) = C 


de! 
oo pd! 2 a 
18 p29"? 
Oe 
_ p29"? 
pe” = C2 (1 + p’d) 
2 
gr = P2 —_ P 
ga (D=+V@/l- C7) 
p 
¢ = Dinp+E (115) 


11. Find the shape of the path traced by a ray of light moving in a material medium of refractive index 
4 13 
n= ae where a is a positive constant, p is a polar coordinate (assuming the path being in a 
p 


plane coordinated by polar coordinates p,¢) and ¢' = d¢/dp. 
Answer: As in the previous Problem, we have: 


i= [af ee ANAL VIF PPP dp _ 
r rv rE Cm 


ap* $3 


a f°? aig = 
ake pd! dp = Id) 


As we see, F' (p,¢,¢’) = p*¢ and hence the Euler-Lagrange equation (see Eq. 4 noting the corre- 
spondence between x,y, y’ and p, ¢, ¢’) is: 


0 
aa (o'¢'°) =e a: (C > 0) 
3p'¢”? = C 
12 — sCe 
g- = 301 
! = C/3 
7) = pe 
D 
oe! See (p = C73) (116) 
p 


12. A light ray is emitted from point (7,11) [in a plane coordinated by a Cartesian system] and it is 
required to reach point (8,37). If the refractive index of the medium of propagation is proportional to 
y, what the initial direction of the ray should be? Also, plot the trajectory between the two points. 
Answer: According to the result of Problem 7, the path is given by Eq. 112 and hence the points 
(7,11) and (8,37) should satisfy this equation. On solving Eq. 112 for x and substituting the two 
points in the resulting equation we get: 


11 
= sh | — D 
7 C arccos ( =) + 
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8 = Carcoosh (F ) + D 


On subtracting the first of these equations from the second we get 1 = C' arccosh (35) — Carccosh (4) 
which can be solved numerically to obtain C' ~ 0.823517739. On inserting this value of C’ into one of 
the above equations we get D ~ 4.295725460. So, the trajectory of the light ray is given by: 


a — 4.295725460 
= 0.82351 sh | 93517739 — 
y = 0.823517739 cos ( 0.823517739 ) 


Accordingly, the initial direction can be determined from the slope of the trajectory at point (7,11), 
that is: 


; (< — 4.295725460 
= sinh 


eae | me ~ 13.319846963 


c=7 
The trajectory is plotted in Figure 60. 
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Figure 60: Plot of the trajectory y = 0.823517739 cosh (ae e100) of the light ray of Problem 12 of § 
5. 


13. A light ray is emitted from point (7,11) [in a plane coordinated by a Cartesian system] and it is 
required to reach point (8,37). If the refractive index of the medium of propagation is proportional to 
1/y, find the equation of the trajectory of the ray. 

Answer: According to the result of Problem 8, the trajectory is given by Eq. 113 and hence the 
points (7,11) and (8,37) should satisfy this equation, that is: 


1 
(7+D)? +1? = a 
(8+ D)* +37? = : 


14. 


15. 
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On subtracting the first of these equations from the second we get (8 + D)” +37?—(7+D)’—11? =0 
which can be solved to obtain D = —1263/2. On inserting this value of D into one of the above 
equations we get C = ,/4/1560485. So, the equation of the trajectory is: 


1263\* 4 1560485 
ee ee ae ee 


which is a circular arc (as concluded in Problem 8). 

A light ray is emitted from point (0, 1) [in a plane coordinated by a Cartesian system] in the direction 
(1,2). If the refractive index of the medium of propagation is n = e¥, determine if the ray will pass 
through the point (0.3,2). Also, plot the trajectory for the range y = 1 to y = 5. 

Answer: According to the result of Problem 9, the slop of the trajectory is y’ = vee So, at 
point (0,1) the slope is: 


y - rai 
2 e2 — C2 
i C 

4G? = (Ores 


So, from Eq. 114 the trajectory is given by: 


ey — (e//5)° 

arctan = z£+D 
e/V/5 

arctan (v 5et(y-1) — 1) = £+D 


Now, since the ray is emitted from point (0,1) this point should satisfy this equation, that is: 


arctan (v 5e2(1-1) — 1) = 0+D 


D = arctan (2) 
Accordingly, the trajectory is given (implicitly) by: 
arctan ( 5e2y-1) — 1) = «+ arctan (2) 
a arctan ( Bery-1) — 1) — arctan (2) 


The point (0.3, 2) does not satisfy this equation and hence it is not on the trajectory (but it is very 
close). The trajectory for the range y = 1 to y = 5 is plotted in Figure 61. 

Make a polar plot for the trajectory of the light ray in Problem 11 for the range ¢ = 7/3 to 6 =7 
assuming the ray passes through the points with polar coordinates (3,7/3) and (1,7). 

Answer: Inserting the coordinates of the given points in Eq. 116 we get: 


T ae 
3 3 
qr = D+E 


On subtracting the first equation from the second we get = = 22 and hence D = m. On inserting this 


value of D into one of the above equations we get E = 0. Hence, the equation of the trajectory is: 


i=, (S<¢sz) 


The plot is shown in Figure 62. 
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0.5 


Figure 61: Plot of the trajectory x = arctan (v 5e2(y-1) — 1) — arctan (2) of the light ray of Problem 14 
of § 5. The point (0.3,2) which is not on the trajectory (but it is very close) is marked. 


90 


240 


270 
Figure 62: The polar plot of the trajectory p = $ of the light ray of Problem 15 of § 5 for the range 


@ = 7/3 to 6 =7. The numbers on the perimeter are the polar angle ¢ in degrees while the numbers 2 
and 4 are the p = 2 and p = 4 circles. 


Chapter 6 
Hamiltonian Mechanics 


One of the biggest fields of application (as well as development) of the mathematics of variation (and the 
calculus of variations in particular) is mechanics where this subject in its variational form was historically 
developed by William Hamilton and hence it is commonly known as the Hamiltonian mechanics.!!9°l In 
the Hamiltonian mechanics distinctive terminology is used. For example, the functional integral J in 
this mechanics is called the “action” and hence the principle of minimizing this integral (which essentially 
reflects the spirit of the variational principle) is called “Hamilton’s principle of least action”.!!97] Distinctive 
notation is also used in the formulation of the variational problems. For example, in the variational 
problems with n extremizing functions (see § 1.7) the independent variable is usually denoted with t 
(which normally stands for time) and the extremizing functions are denoted with qi(t),--- ,dn(t) while 
overdot is used to symbolize the derivatives of these variables with respect to t.!!8! Also, the integrand 
F in the Hamiltonian mechanics is usually denoted with L and is called the Lagrangian (or Lagrangian 
function). Hence, the action integral is given as: 


te 
Tas sa) = f LE (t,91,°** 5Qns415°** s Gn) dt (117) 
ti 
Accordingly, a set of n Euler-Lagrange equations is required where these equations are given compactly 
by: 
OL d/oL 
es Ef we ell) j= 1,--- 118 
a(S) (= 1-0) (118) 
This set of n second order differential equations (which are commonly known as Lagrange’s equations or 
Lagrangian equations) should be solved to obtain the solution of the variational problem (i.e. q@1,--- , dn) 


where the initial conditions can be used to determine the 2n constants of integration that are involved 
in the solution (noting that in some cases the Hamiltonian formulation can lead to first integrals). It is 
worth noting that Eq. 117 may be written in a compact form (using vector notation) as: 


ral = | L(t,a.4) at (119) 


t1 


[106] T¢, may also be called the Lagrangian mechanics. In fact, Hamilton is not the only or the first mathematician to work 
on this topic but we attributed the development to him because in this book we are interested in his version of the 
variational formulation of this field. 

1197] Some authors suggest that Hamilton’s principle is closely related to the principle of least action but it is not the same. 
More specifically, Hamilton’s principle is seen as more general than the principle of least action. Also, being minimizing 
(as suggested by “least”) is not guaranteed by this principle and hence it may be more appropriate to call it the principle 
of stationary action (as will be indicated later). In fact, there are many details about these issues and their alike but 
they are beyond the scope of this book. 

[108] We should note that the extremizing functions q; in the Hamiltonian mechanics usually represent the spatial coordinates 
(and hence their time derivatives q; represent the velocity components) of a mechanical system with n degrees of freedom 
where the configuration of the system is described by these n functions and their derivatives. So, the purpose of the 
variational formulation is to describe the motion of the system and determine its configuration (as a function of time) as 
a result of the external and internal forces which the system is subject to. In fact, the configuration of the system can be 
seen as a t-parameterized “curve” in an n dimensional space. Classical systems are deterministic and hence by knowing 
the configuration at an initial time (by knowing the positions and velocities) it can be determined at any time in the 
future (in fact this may be generalized by saying: by knowing the configuration at a given time it can be determined 
at any other time). So, the variational formulation of a given mechanical problem plus the initial conditions should be 
able to solve the problem completely. 
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where q = (q1,°+- ,@n) and q = (q1,--- , Gn). This notation is particularly useful when the above discrete 
system formulation (with a finite number of degrees of freedom n) is generalized to the continuous system 
formulation (with an infinite n). 

The Hamiltonian mechanics is based on Newton’s laws (or on the Newtonian mechanics to be more com- 
prehensive) in its physical framework while it employs the calculus of variations as its main mathematical 
technique. The fundamental principle of this mechanics is the aforementioned “Hamilton’s principle of 
least action” which states: in a mechanical system subject to conservative forces only the behavior of the 
system (according to the Newtonian mechanics) is described by an extremall!°l of the action integral 
I = [ Ldt where the Lagrangian is given by L = T — U (with T and U being the kinetic and potential 
energy of the system respectively).!!10l 

We should finally draw the attention to the following important remarks: 

e There are two important generalizations to the Hamiltonian formulation. First, qg;’s are not necessarily 
required to be representing conventional coordinates (i.e. they can be used more generally to represent 
the variables and physical conditions of the mechanical system and hence they may be called generalized 
coordinates). Second, as indicated earlier the Hamiltonian formulation is not necessarily required to be 
for discrete systems (e.g. systems of separate particles) and therefore the Hamiltonian mechanics can 
be used to formulate even problems of continuous systems such as fluid dynamics and other continuum 
mechanics systems (see Problem 19). 

e Since the Hamiltonian mechanics is a variational subject that is based on the Euler-Lagrange equation 
(as seen above) it is subject to the variations of the Euler-Lagrange equation (as investigated in § 1.4- 
1.10) and hence the above formulation of the Hamiltonian mechanics (as represented by Eqs. 117 and 118) 
represents the common cases. For example, the Hamiltonian formulation may have only one dependent 
variable (see for instance part a of Problem 4; also see Problems 12 and 19) or it may have multiple 
independent variables (see Problem 19). In brief, the Hamiltonian mechanics is just an application and 
instantiation of the calculus of variations as represented by the Euler-Lagrange equation in its various 
forms and flavors (where the principle of least action plays the major role in establishing and justifying 
the physics) [444] 

e Our objective in the Problems of this chapter is to clarify the variational aspects of the Hamiltonian 
mechanics (as an application of the calculus of variations), and hence any other issue (such as the physics 
behind the individual Problems) is not of major concern or interest to us. Accordingly, the presentation 
and investigation of those other issues may be superficial or non-rigorous. 


Problems 


1. Discuss the following statement: “The Hamiltonian mechanics is based on Newton’s laws in its physical 
framework”. 
Answer: In general terms, the Newton’s laws formulation of mechanics and the Hamiltonian formula- 
tion of mechanics are independent but equivalent formulations and hence they may be seen as equally 
fundamental (noting that each one of these formulations can be derived from the other). Yes, Newton’s 
laws have historical precedence and hence the Hamiltonian mechanics can be seen from this perspective 
as originating from the Newtonian mechanics although the Hamiltonian mechanics may also be seen as 
more fundamental from other perspectives (as discussed in the literature and will be touched on later). 
In fact, there are some differences in opinion about which is more fundamental (assuming a precedence 
in some sense is presumed or established). Anyway, our opinion is that Newton’s laws are more funda- 
mental from a theoretical and conceptual perspective due to the fact that the Newtonian philosophy 
and paradigms (which Newton’s laws and Newtonian mechanics are based on) are at the foundation 
of the Hamiltonian rationale and formulation. On the other hand, the Hamiltonian mechanics can 


[109] We note that in most cases it is minimal but this is not necessarily the case. In fact, to be more accurate and general we 
should use “stationary” instead of “extremal” (but we followed what is common in the literature in stating this principle). 

[110] We should note that the given statement expresses the essence of this principle only (ignoring some details and conditions 
which are out of scope) and hence it is not sufficiently rigorous. We should also note that from this statement we can 
see that the action has the physical dimensions of energy times time. 

1111 Tn other words, the Euler-Lagrange equation provides the mathematics of this mechanics while Hamilton’s principle of 
least action provides the physics of this mechanics. 
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be seen as more fundamental (at least in some cases and situations) from a practical and procedural 
perspective. It may also be regarded as being more fundamental from its variational perspective since 
the principle of variation or optimization is one of the most fundamental physical principles due to 
the fact that many applications and branches of science are (or can be) established from the logic and 
rationale of optimization and variation. Also, see Problem 2. 

2. Make a brief comparison between Lagrange’s equations in mechanics (as embedded in the above- 

described formulation of the Hamiltonian mechanics) and Newton’s laws of motion. 

Answer: We note the following: 

e Lagrange’s equations are physically and logically equivalent to Newton’s laws and hence they can be 
seen as another form of Newton’s laws. 

e Lagrange’s equations are scalar equations while Newton’s laws are essentially vector equations. This 
may be seen as an advantage to Lagrange’s equations since dealing with scalar formulations is generally 
easier than dealing with vector formulations. 

e The difference in the previous point between the two formulations may lead to another difference 
(and advantage to Lagrange’s equations over Newton’s laws) related to the invariance of the two formu- 
lations where it is claimed that Newton’s laws in component form are not manifestly invariant across 
coordinate systems while Lagrange’s equations are invariant. 

e Being based on the paradigms of energy and variation (which are general paradigms that occur across 
various disciplines of physics), the formulation of Lagrange’s equations can be easily and naturally ex- 
tended to fields of physics other than mechanics (and classical mechanics in particular which is the 
birthplace of this formulation). This may be seen as another advantage to this formulation in compar- 
ison to Newton’s laws which are less general in application and usefulness. In fact, this advantage can 
lead to other advantages such as generalization and unification. 

e Being essentially a variational formulation based on the notion of action, the formulation of La- 
grange’s equations is more fit and natural for investigating conservation principles and symmetries 
in physical systems and the relations between the two. This should be seen as another important 
advantage to this formulation over the Newtonian formulation. 

e Noting that the principle of least (or stationary) action is restricted to conservative systems, Newton’s 
laws of motion may be seen as more general from this perspective. In fact, there are more limitations 
and restrictions on the Hamiltonian mechanics (and hence on Lagrange’s equations of mechanics).!!1? 
e Newton’s laws of motion are associated with (and based on) a certain philosophical and epistemo- 
logical framework (unlike the formulation of Lagrange’s equations which is more like a physical theory 
or a mathematical method of purely technical nature) and hence Newton’s laws have more significance 
and far-reaching consequences from this theoretical perspective (although this may be questioned). 

3. State (briefly and non-rigorously) the principle of least action of the Hamiltonian mechanics. 
Answer: In conservative mechanical systems, the particles (possibly within a continuum) follow tra- 
jectories that optimize the action integral J = [{ L dt where L is the Lagrangian defined as the difference 
between the kinetic and potential energies, i.e. LD = T—U. 

4. Find the Lagrangian of the following mechanical systems: 

(a) A yo-yo hanging from a ceiling and it is unwinding vertically downward (see Figure 63). 

(b) A system made of a particle of mass m, connected to another particle of mass mz by a flexible 
inextensible string of negligible mass where m1 is moving on a horizontal table while mz is dangling 
vertically with the string being passed through a tiny hole in the table (see Figure 64). Assume that 
gravity is the only acting force with no friction involved (neither between m1 and the table nor between 
the string and the table or the hole). 

Answer: 

(a) Assume that the mass of the yo-yo string is negligible and the mass of the yo-yo (i.e. its rotating 
part) is m and its moment of inertia is J. Let the plane of the ceiling represent the zero potential 
energy reference (see Figure 63). Now, if | is the length of the unwinded part of the yo-yo string 
then the potential energy of the system is U = —mgl. Regarding the kinetic energy, it consists of a 


(112] Ty fact, generalizations and extensions to the principle of least action and its application should lift some of these 
limitations and restrictions. 
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translational part: 


and a rotational part: 


1 1 2 415° 
Ty = stu = =1 (5) =51 
R 
where v is the translational speed of the yo-yo (i.e. its descending part), w is its angular speed, and 
R is the radius of its axle (assuming the thickness of the winded part of the string is negligible).!'!%! 
Accordingly, the Lagrangian of this mechanical system is: 


Sie: 2 leap BIE eee ol Pv. 
L (ttf) =T-U=T+T,-U = 5mP +51, -mgt= 5 (m+ a5) + mgl 


Note: unlike the Lagrangian of most Hamiltonian mechanics problems, the Lagrangian of this problem 
has only one dependent variable (i.e. /) thanks to the correlation between | and w which we exploited 
to simplify the formulation (as well as being essentially a 1D problem). 


Ceiling 


Figure 63: A schematic illustration of the yo-yo mechanical system. See part (a) of Problem 4 of § 6. 


(b) Let use a cylindrical coordinate system whose origin is at the hole, and its z axis is pointing 
vertically downward (see Figure 64). Also, let the length of the string be / and the zero potential energy 
reference be the table level. Accordingly, the coordinates of m, are (p,¢,0) and the coordinates of m2 
are (0,0, z). However, because the string is inextensible / is constant and hence p + z = 1 which leads 
to z =1-—p. Now, the potential energy of the system is the sum of the potential energies of its parts. 
However, because m1 remains at the zero potential energy level its potential energy is zero and hence 
the potential energy of the system is the potential energy of mz which is U = —magz = —mag (I — p). 
Regarding the kinetic energy of the system, it is the sum of the kinetic energies of its parts, that is: 


T = T14+T2 


2,1 2 
= smyvy + =M2Qv5 


2 2 


[113] Tf the thickness is not negligible then it should be included in R and this leads to more complicated formulation. 
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Figure 64: A schematic illustration of the setting of part (b) of Problem 4 of § 6. 


1 . 1 

— Pie (3? a ia p°d”) al 32 (F 
1 . 1 

ee 5m ("? ae p?d”) ik 5mop" 


(v1 has radial and azimuthal components) 


(z=1— p) 


Accordingly, the Lagrangian of this mechanical system is: 


- lt 63 Yes er eer 
L (t.0, d, P; b) = T — U — ae ("? + p?d?) + 5mab mMm2g (I p) 


5. Given that the kinetic energy T and the potential energy U of a mechanical system are given by: 


sie 5 ml 


show that T + U = constant. What this means? 


Answer: We have: 


—(T+U) 


and 


U=U(q,°:: In) 


where in the last line we used the reduced form of the Euler-Lagrange equation, i.e. the equivalent to 
Eq. 3 (or rather a summed version of it) noting that the Lagrangian L does not contain t. Accordingly, 
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T+U = Ewhere EF is a constant (= —C). We note that in line 6 we use the fact that q7 is independent 
of q; unless 2 = 7 plus the fact that U is independent of q;. 

Now, T + U is the total energy (i.e. kinetic plus potential) and hence the obtained result means that 
the energy of such a system is conserved.|!"41 

. Show that for a simple system made of a single particle of mass m in a conservative force field, Lagrange 
equations are equivalent to Newton’s second law. 

Answer: We have: 


1 1 
L (t, £1, £2, £3, £1, £2,43) = L(t,r,t) =T—U = a(t tog) —U = 5m lel U 


where r = (#1,22,2%3) is the position vector of the particle, f = (%1,%2,%3) is its velocity vector 


and U = U(a1,22,x%3) = U(r). Accordingly, the three Lagrangian equations are ge —_4 (4) =0 
(i = 1,2,3), that is: 
OU d 
BOD eee oe i= 1,2,3 
a ane (i ) 
OU 
Ly 
ave = me 
Ox; ~ : 
—VU mr 
F mit 


where in the last line we used the fact that the conservative force is the negative gradient of the 
potential energy. As we see, the last line is Newton’s second law (i.e. force F equals mass m times 
acceleration f). 

. Find the Hamiltonian formulation!'!5! of a simple mechanical system consisting of a free particle of 
mass m moving in a plane. Also, interpret the result. 

Answer: We use polar coordinates (p,¢) which are sufficient to describe the motion of this particle 
whose movement is restricted to a plane. Now, the particle is free and hence its potential energy U is 


zero.!16l Regarding its kinetic energy, it is T= 5m Ga + p°d?) due to the fact that the velocity has 


in general radial and azimuthal components. Hence, the Lagrangian of this system is: 


L(t,.6,0.6) =T—U = 5m (i? + 08?) 


Accordingly, we have two Euler-Lagrange equations (one equation for each coordinate): 


a (5) a: 
Op dt \ 06 
2 242 d 0 2 242 
A [bor snes] (f[bnestmre]) = 
pi? -“ (mp) = 0 
mpe—mp = 0 (120) 
AND 


[114] More technically, the total energy of a mechanical system in a conservative force field is constant along its trajectory 


in space-time. Alternatively, in a mechanical system whose Lagrangian is independent (explicitly) of time the particles 
follow trajectories along which the total energy (i.e. kinetic plus potential) of the system is conserved. 


[115] We mean by “Hamiltonian formulation” in this sort of Problems the Lagrangian L and the Euler-Lagrange equation(s). 
[116] Th fact, it is constant that can be set to zero due to the arbitrariness of the reference level. Anyway, this may affect the 


Lagrangian but not the Euler-Lagrange equations. 
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oh _ 4 (a) ~ 9 

0p dt \ dd 

glares] 2(B[beredere]) - 
sine’) = 
“(mp*s) = 0 (121) 


Regarding the interpretation of the result, Eq. 120 represents Newton’s second law (i.e. force in the 
radial direction equals mass times centripetal/centrifugal acceleration)|!17I while Eq. 121 represents 
the conservation of angular momentum (noting that Eq. 121 leads to mp?2¢ = constant). 

Note: the expression of the kinetic energy T that we used above can be easily obtained (using the 
equivalent Cartesian system) as follows: 


—— sme? 
1 
= a (#? + y) 
| 2 
= img (pcos ¢) | + [Fos] | 
1 : 2 
= mi [pcos é — pésind] + [ising + pdcoso] \ 
= 5m| i? cos? $ — 2ppdcos dsin d + p?¢? sin? ot 


p’ sin? 6+ 2ppd cos gsin @ + po? cos” | 


— n+) 
8. Find the Hamiltonian formulation of a simple mechanical system consisting of a single particle of mass 
m in the gravitational field of the Earth. 
Answer: If q1, g2, g3 stand for the Cartesian coordinates x, y, z of the particle and ®(z, y, z) represents 
the gravitational potential of the Earth (noting that this potential depends on the coordinates only as 
indicated by the notation) then we have: 


i 1 
L(t,2,y,z,£,9,2) =T-U= sm m® = 5m (t? + 9° + 2%) —m® 


Accordingly, we have three Euler-Lagrange equations (one equation for each coordinate): 


OL _@ OL 0 OL _@ OL _¢ OL _@ OL 
Ox a Oy Oy Oz Oz 
that is: 
moe d . O® d : moe d . 


These equations can be simplified to: 


(1171 This should not contradict the fact that the particle is free. 


6 


9. 


10. 


11. 
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Find the Hamiltonian formulation of a mechanical system consisting of a satellite of mass m orbiting 
the Earth. Use a normalized spherical coordinate system centered on the center of the Earth. 
Answer: In this Problem q1,q2,q3 stand for the spherical coordinates r,@,@ of the satellite and 
®(r, 0, ¢) represents the gravitational potential of the Earth (which solely depends on the coordinates). 
Now, in spherical coordinates the velocity is v = (7,r6,résin@) and hence v? = 7? + r262 + r2¢2 sin? 6 
while the gravitational potential is 6 = —C/r (with C being a positive constant).!'!8] So, the La- 
grangian of this system is: 
1 


L (t,7,9, od, 0, 6) =T U = 5m me = 5” (i? } r2 6? t rg sin? 6) + nae 
r 


Accordingly, we have three Euler-Lagrange equations (one equation for each coordinate): 


Oby OfOLN” .. i gate ao, ae h 

OL? ad FOL. -. 272 d (29) _ 

on tae (35) = 0 > r* 6° sin 6 cos 6 af (r 6) = 0 (123) 
OL d (OL\ _ d (25 29) _ 

er (5) 5% a 5 (r sin 6) =0 (124) 


Simplify the Hamiltonian formulation of Problem 9 by assuming that the orbit is in the equatorial 
plane of the coordinate system (i.e. the plane 0 = 7/2 that passes through the center of the Earth). 
Answer: In the equatorial plane @ = 7/2 and hence cos@ = 6 = 0 and sind = 1. Therefore, Eqs. 
122-124 reduce to: 


- _ 
rp — po 
0 = 0 
d / 4; 
(rs) = 0 
On discarding the second equation (which is trivial) we get two Euler-Lagrange equations: 
; C 
re—-—=-F = 0 (125) 
r 
d/o»; 
= = 0 126 
dt G 8) 28) 


Analyze the results of Problem 10. 
Answer: If we multiply Eq. 125 with m (which we discarded earlier for simplicity) and rearrange the 
terms we get: 

_ =m (rd? is i) (127) 
= 
where ae is the magnitude of the gravitational force while (re? —*f) is the magnitude of the radial 
acceleration and hence Eq. 125 is just Newton’s second law for the gravitational field (which is a 
central force field). 
Regarding Eq. 126, it can be easily integrated to obtain r2¢ = C, (with C, being a constant). Now, 
rg is the magnitude of the angular momentum per unit mass and hence Eq. 126 represents the 
conservation of angular momentum. In fact, rg also represents twice the areal speed and hence the 
equation r2¢ = C, represents Kepler’s second law. 
Note: we may put Eq. 127 in the following form: 


C . 
es +m = mr¢? 
= 


[118] In fact, C = GM where G is the gravitational constant and M is the mass of the Earth. 
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where the left hand side represents the total central force (i.e. gravitational plus inertial) while the right 
hand side represents mass times centripetal acceleration (and hence the equation represents Newton’s 
second law for central gravitational fields). 

Obtain the Hamiltonian formulation of a mechanical system consisting of a single particle of mass m 
executing a 1D simple harmonic motion. What type of solution this system has? 

Answer: For simple harmonic motion we have: 


L(t,2,4)=T -—U = mz? she” 


where t is time, x is the displacement of the particle from its equilibrium position, z is its speed 
(i.e. dx/dt) and k is a positive constant (i.e. the “spring” constant). Accordingly, we have a single 
Euler-Lagrange equation, that is: 


ab _d (at) _ , 
Ox dt\dz) — 
O [1.5 2 adfad ll. 1,5 
=| mx she?| a & [sm 5 he = 0 
d ae 
—ka — = (ma) = 0 
mitkxe = 0 


The solution is obviously a sinusoidal function of time (with frequency oes and appropriate mag- 
nitude and phase shift). 

Obtain the Hamiltonian formulation of a mechanical system consisting of three particles connected in 
series by three springs in-between (and the entire system is connected to a fixed support S). Assume 
that the springs are linear (Hookean) and massless and the only forces acting on the system are the 
spring forces (so the effect of other forces like gravity is negligible). 


53 


TZ 


Figure 65: A schematic illustration of a system consisting of three particles (of masses m1,m2,m3) 
connected in series to three springs $1, S2,53 and secured to a fixed support S. See Problem 13 of § 6. 


Answer: Let m1,™m2,m3 be the masses of the particles, 51,5253 the springs, k,,k2,k3 their spring 
constants, and 21, 2,73 the displacements of the particles from their equilibrium positions O;, O2, Os 
(see Figure 65). Now, my, is displaced from its equilibrium position O, by x; and hence its potential 
energy (which is due to the stretch/compression in S) is skirt. Regarding mg, it is displaced from 
its equilibrium position Og by x2 but part of this displacement is due to the displacement x, and 
hence its potential energy (which is due to the stretch/compression in $2) is $k2(x2 — x1). Similarly, 
mg is displaced from its equilibrium position O3 by x3 but part of this displacement is due to the 
displacement x2 and hence its potential energy (which is due to the stretch/compression in $3) is 
+k3(x3 — 22). So, the potential energy of the system is U = $k aj + $k2(x2 — 21)? + $k3(a3 — 2)”. 


Regarding the kinetic energy, each particle m; (¢ = 1,2,3) has a speed «; = dae 


which represents 
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the temporal rate of change of its position relative to its equilibrium position (noting that all the 
equilibrium positions are at rest). Therefore, the kinetic energy of each particle is Smjx? and hence 
the kinetic energy of the system is T = $m @7 + 4mo43 + $m3%3. So, the Lagrangian of the system 


1s: 


2 


1 
KS a) a) -2 2 
L(t, 21, £2, %3,%1,%2,%3) =T—-U= 5 [mia] + Mok + m3h3 — kay — ko(x2 — 21) 


Accordingly, we have three Euler-Lagrange equations (corresponding to x1, 12,43): 


OL d/foL : 

ace a & = 0 => kyay + ko(a2 — #1) — m1 #1 = 0 

OL d/foL cs 
Ona de (an) = = — ka (ag — 1) + kg («3 — 2) — mat = 0 
OL d/foL ‘. 


14. Obtain the Hamiltonian formulation of a simple pendulum consisting of a particle of mass m hanging 
at the end of an inextensible and weightless string of length / and swinging in a vertical plane where 
(uniform) gravity is the only active force (see Figure 66). Suggest a simple solution. 


(1 — cos 6) 
Figure 66: A schematic illustration of the simple pendulum system. See Problem 14 of § 6. 


Answer: For this system we have: 


L (1,0,6) =T —U = 5ml?6? — mgl (1 — cos 8) 
where t is time, 8 is the angle of displacement from equilibrium, 6 is the angular speed (ie. d0/dt), and 


g is the magnitude of the gravitational field (the so-called gravitational acceleration).!'19|_ Accordingly, 
we have a single Euler-Lagrange equation, that is: 
OL d/foL 
Sates 


060 «dt \ ae 


[119] The zero potential energy reference corresponds to the position 6 = 0, i.e. when m is below the ceiling by a distance I. 
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O [1 os, d [0 [1 o-5 _ 
0 gl" mgl (1 cos6)| 7 (= smi mgl (1 — cos @) = 0 
. d 29) _ 
—mgl sin 9 ~ = (mi 6) =: 40 
—mgl sin @ — ml?6 = 0 
l6+gsind = 0 


For small @ we can use the approximation sin@ ~ @ and hence the pendulum equation becomes 
16 + g9 = 0. So, the solution (according to this simplification) is a sinusoidal function of time (with 
frequency V4). 

Obtain the Hamiltonian formulation of a compound pendulum consisting of a particle of mass mj, 
(hanging at the end of an inextensible and weightless string of length 1) to which a second particle 
of mass mz (hanging at the end of an inextensible and weightless string of length Iz) is attached (see 
Figure 67). Again, the swing of the compound pendulum is restricted to a vertical plane and (uniform) 
gravity is the only active force. 

Answer: Let solve this Problem using the setting of Figure 67 where we use an inverted Cartesian 
coordinate system (i.e. its y axis is pointing downward) in the vertical plane. The origin of the 
coordinate system is taken at the hanging point of the string /, while the zero potential energy reference 
is taken at y = 0. In this setting 6, and 62 are the angles of displacement of m, and m2 from the 
vertical line. Now, the positions of m, and mz are: 


@ IL 


Figure 67: A schematic illustration of the compound pendulum system. See Problem 15 of § 6. 


(21,41) = (Ly sin 91, ly cos 0) and (x2, y2) = (11 sin 9; + ly sin 02, ly cos 6; + lo cos 02) 
Hence, the potential energy of the system is: 


U = —migyi — M2gy2 
= —mygly cos cal — m2g (11 cos 0, + Ip cos 02) 


(—migh — magl) cos Cal om mogle cos A 
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= —[gli (mi + mz) cos 01 + maglz cos 42] 


while its kinetic energy is: 


T 


l| 


I 


I 


1 2,1 2 
sMyvV7 + =M2Qv5 


2 2 


dL ; : 1 : F 
sy (2 +92) + 4m (83 + 69) 


1 : F 
5m (136? cos? 6; + 176? sin? 61) + 
1 : ‘ 2 e : 2 
gi” (Zz cos 6; + 1242 cos 6.| + 16, sin 9; — [969 sin 6s] ) 
1 : 1 : ee F 
sili + 5me (i cos” 6, + 214120162 cos 01 cos 02 + 1305 cos” 62 + 


126? sin? 0, + 21,1261 62 sin 0; sin 02 + 1363 sin” .) 


1 oe 
similgde + sms [126 + 1263 + 211126162 (cos 6; cos 8s + sind) sin 4.)| 


1 
2 
1 | : : Tod 
smal + Sma [1262 1262 211 126, 62 cos (Oi ~ 62)| 


3 (m4 + m3) 0; + ginal383 + Mol1 1201 02 COs (0; = 2) 


So, the Lagrangian of the system is: 


L (t,61,62,61,63) = T-U 


1 ; 1 ; a 
= 3 (m4 + mg) 126? + 5 irala02 + Mol, 120162 Cos (A, = 62) + 


gli (mz + m2) cos 01 + magly cos Og 


Accordingly, we have two Euler-Lagrange equations (one equation for each 6): 


Ok, 7 OLN: ... 
06, dt 06, a 
—mMal 126100 sin (1 a & 62) = gl (m4 i m2) sin 9; 
d : : 
rr [om + m2) 170, + Mal, leo COs (1 = 62)| = 
—mMal) 126100 sin (1 aa 62) =F gl (m4 oe m2) sin 9; 


a (my + mz) 126, — Maly 1o6> cos (0, ae 02) + Maly loo (41 = 6.) sin (0, mi 02) = 


—myl36> = Maly 1964 cos (0, A 02) + Maly 1961 (41 = 6.) sin (0, = 02) 


AND 
Ob ~ BPOEN: . 
06 dt 06> a 
mMaly12616> sin (1 = 62) = maglz sin A 
d F . 
ae [ral}b. + mglyl261 cos (01 _ 62)| = 
Malyl26162 sin (1 = 62) mat maglz sin A 


l 
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16. Obtain the Hamiltonian formulation of a spherical pendulum consisting of a particle of mass m hanging 
at the end of an inextensible and weightless string of length | and swinging around (in all directions 
but restricted by the length /) where the (uniform) gravity of the Earth is the only active force. 
Answer: We use a normalized spherical coordinate system centered on the fixed end of the string 
with the 6 = 0 axis (corresponding to the z axis) pointing toward the Earth center. In this Problem 
the dependent variables are the spherical coordinates r,@,@ of the particle m and we take the zero 
potential energy reference to be the plane passing through the origin of coordinates and parallel to 
the surface of the Earth. Now, in spherical coordinates the velocity is v = (7, r6,rdsin 6) but since m 
is always at distance | from the origin then r = / and * = 0 and hence the velocity of the particle is 


v = (0,10,lésin@). Accordingly, the kinetic energy of the particle is T = sm (76 + (2¢2 sin? 0) and 


its potential energy is U = —mglcos6@. Therefore, the Lagrangian of this mechanical system is: 
ons lee 1 2 ( p2 2 112 
L (t,r,0,6,#,6,6) =T-—U = 5ml (4 +¢@ sin 8) + mgl cos 


Accordingly, we have three Euler-Lagrange equations (one equation for each coordinate): 


ob _d (ab) _ 
Or dt \ Or a 
OO lh ah aa tote @ Bf O [Le eft i = 
ar 5m! (4 + p* sin 6) + mgl cos 6 — alae gm (4 + 9° sin 8) + mgl cos = 0 
0-0 = 0O 
0 = 0 
AND 
ob _d (ab) _ 
00 dt \. a0 a 
CM eee Cee eee @f(O [1 offs yo 2 = 
mala! (4 + o* sin 6) + mal cos 6 marae: gm (4 + 9° sin 8) + mgl cos = 0 
ee me . d/l 4/,; = 
gm (26 sin cos) ~ mgl sind — = nt (26) = 0 
ml?¢? sin 6 cos 0 — mgl sin é — m?6 = 0 
1¢? sin 0 cos 0 — g sind — 10 = 0 
AND 
ob _d (ab) _ 
d6 dt \d¢ 7 
6) 1 2 (f2 OS. u d 0 1 2 [ p2 b2 ain2 _ 
oa [5m (4 + @* sin 6) + magl cos 6 =a a6 gm (4 + 9° sin 8) + mgl cos = 0 
d tL epehc he. to =, 
0-5 (5m (26sin 6)}) a 0 
ml? (dsin? 6 + 246 sin 6 cos 6) = 0 
dsin?6+2d0sindcos@ = 0 


So, we have only two useful (non-trivial) equations. 

17. Use a Hamiltonian approach to analyze a free system made of two massive particles interacting by a 
conservative force that solely depends on their separation. 
Answer: If the masses of the particles are m; and m2 and their position vectors are r; and re, then 
the Lagrangian of the system is: 


<r 1 : 1 : 
L(t,r,%2,t1, 2) = Pi lt)? + ima lé2|? — U (x) 
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where r =r, —Yro= 


185 


(11, 2,23) is the separation vector. Now, let R = mary tmere = (X1, X2, X3) be 


the position vector of the center of mass of the system (with MW being the total mass, i.e. M =m +m). 


Accordingly: 
YT, = r+ Sor — Sars 
= wnt a? ae 
_ my, +™mMe m2. ews 
= M 1+ aet2— at? 
— ee eee we 
anes) Aaa aaa Mame Ys 
_ TA ‘fe m2. mo. Ma, 
Se Met ee ee = eS 
= mir, + Mere mg (r 12) 
Vi irae 2 
m2 
= R — 
+ Vag 
AND 
my 
r = —r,-—r,+r 
2 i ae 2 
= My My M 
= we Vin Me? 
a das des my, +™Me2 
~ Mt M* M 
_ Ot. aes an (te. m2 
Ne i eee Ve 
= tas eee te 4. Pa 
aN Ne ye 
Mi, + M2rg My, icc) 
uM uw 2 
=. aa oe 
M 
Hence, t=R+ arr and t. =F — +r. Therefore: 
2 
piP = (RMB) (Re MB =a ate Ep 
2 
an a : mi, my \ my My 1412 
lal" = (R- 774) (R- FF) = |B aap E+ ple 
Accordingly, the Lagrangian becomes: 
1 .|2 3 ; 
L = 5m (|i +272 Re + 73 si) sina (jf —2ER £4 5 Lief?) - U (r) 
1 . |? , imim 4 mami 
5m [Rf + Sarre lee + 5m |] +5 UG) 
1 + |? m m3 ae 
=} (rm +a) || 5( a el? U(r) 
1 . (2 mym2M 2 
= 5M [R] + ap ve) 
1 . |2 
= 5M[Rl +2 a? -Ue@) 
1 . MiM2 /. . ; 
= 5M(X?+ X24 X23) + SU (a2 + a9 + 49) - U (er, 92,25) 


2M 
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Now, from the perspective of the center of mass (where the Lagrangian is a function of the coordinates 
X1, X29, X3 and their derivatives as well as time), the Euler-Lagrange equations are: 


aL od ( AL 
me) ma ro 
d g 
0- © (uX;) = 20 
MX; = 0 
X; = 0 
R = 0 


This result is logical because the system as a whole (and hence its center of mass) is free of any force 
and therefore its acceleration should vanish according to Newton’s first law. 

From the perspective of the interacting particles (where the Lagrangian is a function of the coordinates 
X1,02,x3 and their derivatives as well as time), the Euler-Lagrange equations are: 


OL d/oOL 
salar) = 0 (i = 1,2,3) 
OU d myzmMs, , 
OU mime. 
(Ox; as M ea a 
mimg., _ OU 
oT ia ae 
mf = —VU 


where m, is the reduced mass and V is the gradient operator. This result is also logical because it 
represents Newton’s second law, i.e. mass times acceleration equals force (noting that the conservative 
force is the negative gradient of the potential energy). 

Show that a particle whose trajectory in a 3D Euclidean space is restricted to an equipotential surface 
follows a geodesic path. 

Answer: We describe the particle trajectory (which connects its start and end points) by Cartesian 
coordinates x, y, z. Now, since the trajectory is restricted to an equipotential surface then the potential 
energy of the particle is constant, that is U = C’. Hence the Lagrangian is: 


1 
L(t,2,y,2,£,y,2) =T—-U= 5m (#° ty +s)-C 


where m is the mass of the particle. Accordingly, we have three Euler-Lagrange equations (one equation 
for each coordinate): 


dL a (aL 

=~ als) au - — 
5b. dk (OE 

~-2(2) = re s 
ee Wee : 
alae) =e zs eae 


Now, let parameterize the Cartesian coordinates of the geodesic that connects the two end points of 
the trajectory with ¢t and hence x = x(t), y = y(t) and z = z(t). Accordingly, the geodesic is the path 
of optimal length s as given by: 


s= | as= [ Vlei + (an? + (aay =f" (Pee aa Teas 
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and therefore F = ,/4#? + y? + 22. Now, referring to footnote [12] F’ is independent of t,x,y,z and 
hence the Euler-Lagrange equations are « = constant, y = constant and z = constant, which lead to 
&=0, ¥ = 0 and % = 0. This means that the mathematical formulation of the two problems (i.e. 
motion over an equipotential surface and motion along a geodesic curve) is the same and hence the 
trajectory as found from the Hamiltonian formulation is identical to the trajectory as found from the 
geodesic formulation, i.e. the particle follows a geodesic path. 

Note: those who may not feel comfortable with the use of footnote [12] may use Eq. 3, that is: 


Oo ED) a) 5) +21 32 
ps =O =f = 
Or e+ P+ HP 
0./PL PLE “2, 32 
e+pt ey -~«4 => EE, 
OY e+ y2 + 22 
O/P +P 42 any: 
poegee a IE Sey => ee nF 
EE PL eee 


which lead (by adding and simplifying) to \/%? + y? + 2? = constant. Hence, from Eq. 4 we get: 


OF & Ci, OF _ y _oC OF = z 


which lead to « = constant, y = constant and z = constant, and hence % = 0, y = 0 and z= 0. 
Use a Hamiltonian approach to analyze the transverse oscillation of a taut string. 

Answer: Let the string in its equilibrium (non-oscillating) state be along the x axis. To simplify the 
analysis, we assume: 

e The oscillations are in the y direction only (and hence the string does not oscillate in the x and z 
directions).!!20l 

e The oscillations are small (and hence the displacement in the y direction is tiny). 
e The tension in the string is the only force involved (and hence forces like gravity and friction are 
negligible). 1221 

Before we go through our detailed analysis we should note that this Problem is different from the 
previous Problems of this chapter in two important aspects. First, it is a continuous system problem 
(not a discrete system problem) because string is a continuous object (like curve) and hence it cannot 
be represented as discrete particle(s). Second, in the following formulation we use two independent 
variables and one dependent variable (unlike the common Hamiltonian problems where we usually have 
a single independent variable and multiple dependent variables). 

Now, let 7 be the tension (force) in the string and yz its linear mass density. Also, let the displacement 
of the string in the y direction be a function of time t and position x, ie. y = y(t,x). The kinetic 
energy of the string is the sum of the kinetic energies of its parts and hence it is given by the following 
integral (where the integral plays the role of continuous sum):l'?] 


2 4 (Oy\* a, 
Gs eee ey ee 
LS (GE) wa [Gate 


C6 


[121] 


1120] fy other words, the oscillations are restricted to the zy plane where each particle of the string moves along a straight 


line parallel to the y axis. 


1121] More precisely, 2¥ < 1 for all a and at all times. However, this condition may be needed only for some restricted 


Ox 
models (e.g. for constant tension and uniform mass density). 


[122] Tn fact, non-conservative forces can be excluded by the restriction of the Hamiltonian approach (which we employ here) 


to conservative systems. 


[123] We note that the (continuum) expression Eye dz is no more than the usual (discrete) expression of the kinetic energy 


4mv? where pdx represents m while ye represents v?. So, T = he fur dx is the continuous version of the discrete 


version T = Soi, EMyv?. For a more technical approach, the reader is referred to the literature (see for instance 
Weinstock in the References). 
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where x; and 22 are the x coordinates of the end (fixed) points of the string and y, = Oy/Ot. Regarding 
the potential energy, we simply apply Hooke’s law (in its continuous version) and hence:!!?41 


2 7 (Ay \? os aae 
= _ —— — = d. 
U [ 5 ($4) dx [ 5 Ya dx 
where y, = Oy/Ox. Hence the Lagrangian is: 
a = Bas " dx = = 2 ya 
LE 2, ys yoyo) =T -—U= 5 Ut aX — 5 yr Wt = 5 (uy; — TyZ) v 
X11 zx 


xy 


Now, if we apply Eq. 117 to our Lagrangian (noting that we have a single dependent variable y) we 


get: 
te te 1 v2 ‘ ; 1 t2 x2 7 9 
I [y] =| Ldt =| 5 (uy; — Ty) dx| dt = > / (uy; —Ty3) dx dt 
ty ty 2 Ly 2 ty ry 


As we see, this is a functional integral (with F = py? — ry2) of a variational problem with multiple 
independent variables (see Eq. 14 in § 1.6) and hence we use Eq. 15 (noting that t,2,y, yz, yz in our 
case correspond to 21,22, Y, Yr, Yr, in Eqs. 14 and 15), that is: 


OF 9 (OF) 9 (aF\ _ , 
Oy Ot \ON Ox \ OYx 
0 2 2 0 0 2 2 0 0 2 2 a 
Dy [eye — TYZ| Al ( Dyn [eye — TYZ] aa \ Bys ey; —Ty2]) = 0 
0 fo) 
0- a (2uy2) Da | 2Tyx) = 0 


0 rs) 
yi (uyt) — Ae (Tyz) = 0 


Now, if ys and 7 are independent of ¢t and x then the last equation becomes (noting the meaning of the 
partial derivatives with respect to the independent variables as explained in § 1.6): 


wa w)— To (ve) = 0 
O [Oy O [Oy _ 
wx (3) "Ox (54) = 
O7y O7y 
rae pee 


which is the 1D wave equation for small transverse oscillations on a taut string of uniform linear density 
and constant tension (with wave speed vy = \/T/,1). 


[124] Instead of going through detailed analysis (which is time consuming and distracting from our variational objective) we 
can justify the above integral as follows (using slack notations and deliberations): 


72 7 (Oy 2 72 7 Oy Oy T2417 dx 21 7 2 
U= dx = dx = OyO = (e) 
a 2 (3) a . 2 Ox Ox ¥ [ 2 Ox z Van is 3 da | ¥) 


As we see, oe corresponds to k and (Oy)? corresponds to x? in the expression of the potential energy of the common form 
of Hooke’s law (i.e. U = tka). So, U= te Fy? dz is the continuous version of the discrete version U = S7j_, dkia?. 


For a more technical approach, the reader is referred to the literature (see for instance Weinstock in the References). 


Chapter 7 
Sturm-Liouville Problems 


The Sturm-Liouville problems are defined by the following differential equation (on a given interval a < 
zg < b);125] 

= < (py') + ay = Awy (128) 

Be 

where p,q, w,y are functions of x (with p and w not vanishing on the interval), is an eigenvalue of the 
eigenfunction y, and y’ = dy/dz. It can be easily shown that Sturm-Liouville problems can be formulated 
and solved as variational problems. In fact, the Sturm-Liouville differential equation is no more than the 
Euler-Lagrange equation for a certain type of variational problems with constraint (see Problem 3). This 
establishes a relationship between the eigenvalue problems and the calculus of variations (see Problem 7). 


Problems 


1. Write the Sturm-Liouville equations for the following sets of p, q, w: 
(a) p=1,q=a2,w=1. 


(b) p=a, q=e*, w=2° (where a and 0 are constants). 
(c) p=—2,q=2,w=2°. 
Answer: 


(a) Inserting p,q, w into Eq. 128 we get: 


—£ (ly) +ay=Aly that is —y" +ary=ry 
(b) Inserting p,q, w into Eq. 128 we get: 
_ (ay’) + e®y = Any that is — ay” +e"y = day 
(c) Inserting p,q, w into Eq. 128 we get: 
-= (—axy’) + ry = Ax8y that is sy +y +ay = dAx*y 


2. What are the p,q,w for the following Sturm-Liouville equations: 
(a) y” — 5a3y = —2Azy. 
(b) x?y” + 2ay’ + yln |x| = —aAy (where a is a constant). 
(c) e?*y"” + 2e%ty’ — ¥ = ABH 


8 
(d) y” — zy t+ Ax?y = 0. 
Answer: 
(a) Putting the equation in the standard form of Sturm-Liouville equation (Eq. 128) we get: 
d 
— ay (oy) + (—52*) y = (20) y 


By comparing this to Eq. 128 we see: p = —1, q = —5a?, and w = —2z. 
(b) Putting the equation in the standard form of Sturm-Liouville equation (Eq. 128) we get: 


© (0°y/) + yln|a = —ary 


1125] The reader should note that Sturm-Liouville equation may be given in other forms which in most. cases differ from this 
form by the sign of some terms and possibly minor differences in notation and convention. 
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By comparing this to Eq. 128 we see: p = —2?, q = In|z|, and w = —a. 
(c) Putting the equation in the standard form of Sturm-Liouville equation (Eq. 128) we get: 


y _ Ady 
ec) 


_ (c??y" ait 2c?" y') 4 (=) y 


d 
a (ery) Me 


—e2%y! =~ 267" y/' 4 


| 

»- 
a™~ 
8, on 
SY 

< 


—~ 
Sie 
Ns 

Kad 

II 

- 

Zona 
Blan 
Sy 

K—g 


By comparing this to Eq. 128 we see: p = e?”, g = 1/x, and w =5/z°. 
(d) Putting the equation in the standard form of Sturm-Liouville equation (Eq. 128) we get: 


yay = —Ar*y 
——(-y')+(-2)y = A(-2*)y 


By comparing this to Eq. 128 we see: p= —1, gq = —2, and w = —2?. 


3. Show that the Sturm-Liouville problem (as given by Eq. 128) is equivalent to the constrained varia- 
tional problem: I{y] = Iy[y] — AZo[y] where:!!?6 


b b 
Ane i. (py? +ay?) dv and ly = if wy? de (129) 


Answer: According to the formulation of constrained variational problems (see § 1.8) we have: 


b b b 
Ty] = Llyl]-Ably] = 1 (py’? + ay") ich -n 1 wy? is} =| (py’* + ay? — Awy”) da (130) 


Hence, H = F — \G = py’? + qy? — Awy? and the Euler-Lagrange equation (see Eq. 22) is: 


0 2 2 2 d 0 2, 2 2 _ 
By Wey + qy* — Awy?| om ay PY + qy?—Awy*]} = 0 
d 
2qy — 2Awy — = (2py') = 0 
d / 
—2— (py!) + 2ay = 2dwy 
Geto 
— Gy (Py) + ay = wy 


which is the Sturm-Liouville equation (Eq. 128). 

Note: a consequence of the result of this Problem is that Sturm-Liouville problems can be treated and 
formulated as constrained variational problems (see Problem 4), and some (but not all)!!?7! constrained 
variational problems can be treated and formulated as Sturm-Liouville problems (see Problem 5). 


[126] The reader should note that the eigenvalue of the Sturm-Liouville problem is represented by the Lagrange multiplier 
of the variational problem. We should also note that the minus sign in the constrained formulation is allowed because 
the sign of » is rather arbitrary. The use of the minus (instead of plus) sign is to keep the given form of Sturm-Liouville 
equation. 

[127] This restriction is due to the fact that JI, and Ig in Eq. 129 are restricted to certain forms. 
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4. Find the functional integrals J for the Sturm-Liouville equations of Problem 2. 
Answer: From Eq. 130 we have I = it (py? + qy? — Awy”) dx. So, from the results of Problem 2 
we have: 
(a) p=—1, q= —5z, and w = —2z. Hence: 


b 
I= / (—y? — 5a®y? + Avy?) dx 
(b) p= —2?, q=In|z|, and w = —a. Hence: 
b 
I =| (—2?y’? + y? In |x| + Aay?) dx 


(c) p=e?*, q=1/z, and w = 5/x?. Hence: 


1 5 
fof (ery? +?) dx 
7 x x 


(d) p=—1, q=—2, and w = —z?. Hence: 


b 
I =i (—y’? — ay? + Ax? y?) dx 


5. Find the Euler-Lagrange equation for the constrained variational problem of part (a) of Problem 5 of 
§ 1.8 and the constrained variational problem of part (b) of Problem 7 of § 1.8 without applying Eq. 
22. 

Answer: As we saw in Problem 3, H of a constrained variational problem (of a certain form whose 
I, and Ig are given by Eq. 129) is given by H = py” + qy? — Awy? where p,q, w are the parameters of 
the Sturm-Liouville equation that corresponds to the constrained variational problem (and hence the 
Sturm-Liouville equation represents the Euler-Lagrange equation of the given constrained variational 
problem). 

Now, for part (a) of Problem 5 of § 1.8 we have H = py’? + qy? — Awy? = xy” + \x?y? and hence the 
Euler-Lagrange equation for this constrained variational problem is the Sturm-Liouville equation with 
p=2,q—=0and w = —2”, that is: 


d / 
cee 0 = —)a? 
ag ey) + xy 
—ay"” —y! = —\x?y 
xy” +y! — A\x?y = 0 


which is what we found in part (a) of Problem 5 of § 1.8 by applying Eq. 22. 

Similarly, for part (b) of Problem 7 of § 1.8 we have H = py” + qy? — \wy? = y? — Ay? and hence 
the Euler-Lagrange equation for this constrained variational problem is the Sturm-Liouville equation 
with p= 1, q=0 and w = 1, that is: 


d 
-=(y')+0 = A 
y’+aAy = 0 


which is what we found in part (b) of Problem 7 of § 1.8 by applying Eq. 22. 
6. Using the Sturm-Liouville equation (Eq. 128), confirm that 4 = I) /I. 
Answer: Multiplying Eq. 128 with y, we get: 


d , 2 Ty 2 
— YG, (py) + ay = wy 
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b b b 
d 
-| I (py’) dx +/ q’?dx = Awy” dx (integrating) 
‘ b b b 
—[ypy'], + / py” dx + i q’ dx = Awy? dx (integration by parts) 
b b b 
j py”? dx + q’ dx = Awy” dx (using suitable boundary conditions) 
b b 
is (py? +qy?) dx = Af wy? dx 
Ih = Xp (see Eq. 129) 
hy de a 
Ip 


Note: in the integration by parts formula [udu = wv — f udu we use u = y and v = py’. 

. Referring to the formulation of Problem 3 and assuming normalization such that I2[y] = 1,128! show 

that the stationary values of I;[y] of the variational problem produce the eigenvalues of the Sturm- 

Liouville problem. 

Answer: From the result of Problem 6 and the assumption J2[y] = 1 we have:!!?9 
qh 

r — i = I 

Accordingly, the stationary values of J, of the variational problem produce the eigenvalues (X’s) of the 

Sturm-Liouville problem, as required. 

. Referring to the formulation of Problem 3, show that obtaining the stationary (or extreme) values of 

I is equivalent to obtaining the stationary (or extreme) values of J; /I9.|!°° 

Answer: Noting that variation follows the pattern of differential (and hence the rules of differentiation 

apply in its manipulation), we have: 


f, 61> Ak 
s(f) = oye 
bli. Eifi5) 
oe ge tne 
6h, — (y/Ip) 612 
Ip 

_ 6h, — rb 
gan 
_ OI 
> 


where in line 4 we use I, /Ig = X (see Problem 6). Now, since I, is constant (see § 1.8) the last equation 
means that stationarizing (or extremizing) I is equivalent to stationarizing (or extremizing) I /Iz. 

. Discuss and analyze the result of Problem 8. 

Answer: Since stationarizing I is equivalent to stationarizing I,/Iz (= ) then any function y that 
stationarizes I should stationarize I, /Iz (and vice versa). Now, because the functions that stationarize 
I are solutions of the Sturm-Liouville equation (see Problem 3) then this means that obtaining the 
functions that stationarize I,/Ip is equivalent to obtaining the functions that are solutions to the 


[128] We remind the reader that according to the formulation of constrained variational problems (see § 1.8) Iz is constant. 


[129] The reader should be careful in reading and interpreting this equation and its alike. This is due mainly to the (rather 
loose) use of J; where sometimes it stands for the value and sometimes for the stationary value. 


[130] Although this seems trivial (considering that Ij[y] is constant), the purpose is to show this technically (with the 
assumption J2[y] = constant being used only in the final stage). 
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Sturm-Liouville equation. In fact, this should facilitate the solution of (certain types of) differential 
equations!!34] by the methods of variational calculus (as well as benefiting from the techniques of solving 
differential equations in variational calculus). This will also have implications and consequences on the 
estimation and evaluation of the eigenvalues and eigenfunctions of the Sturm-Liouville equations (as 
will be demonstrated in the upcoming Problems). 

10. Using the results of the previous Problems, try to propose a method for estimating the eigenvalues 
and eigenfunctions (and obtaining bounds and approximations) of the Sturm-Liouville problems. 
Answer:!!87]_ Based on the results of the previous Problems, the stationary values of I,/Iz of a 
(restricted) variational problem (that complies with the above formulations and conditions) are the 
eigenvalues of the corresponding Sturm-Liouville problem. This means that the values of I,/Ig for a 
given problem should lie between the minimum eigenvalue \,,, and the maximum eigenvalue A jy of the 
Sturm-Liouville equation, that is:!!%*I 


I 
Nyy Se Se Nig 
Ip 


Accordingly, an estimation of I,/Ig for a given problem will provide an upper bound on A, and a 
lower bound on yy without going through any variational process or optimization procedure.!!34l 
Moreover, a (guessed) trial function y, that is used in this estimation could provide an approximation 
to the stationarizing (ie. true) eigenfunction y that corresponds to these eigenvalues (i.e. A, and 
Am) where the best of the trial functions usually corresponds to the best of the estimated eigenvalues. 
These issues will be clarified in Problems 11 and 12. 

11. Find an upper bound on the lowest eigenvalue of the following Sturm-Liouville problem: 


y” +Ay=0 ~~ with boundary conditions y(0) = 0, y(1) = 1 and y’(1) =0 


using the following trial functions (which all satisfy the given boundary conditions as they should 


be): 4351 
(a) y% = 24+ 23 — 6x? + 5x (b) y% = 2° — 32? + 3a (c) y= a +2242 
(d) ye = 2x — 2? (e) yw = a4 — 3a? +2? + 5a. 


Answer: The equation y” + Ay = 0 can be put in the following form: -4 (y’) = Ay. Comparing this 


form to the standard form of Sturm-Liouville equation (as given by Eq. 128) we have: 
Accordingly (see Eq. 129): 


I 


1 1 
i i (py? + qy”) dx = i y” dx (131) 
0 0 


1 1 
Ip = [ wrac= f y? daz (132) 
0 0 


a) For the trial function y, = «* + 2° — 6x? + 5a we have: 
y 


1 1 
Diy | (40° + 3a? — 122 +5)° de = i. (162° + 242° — 8724 — 32° + 1740? — 1202 + 25) dx 
0 0 


[131] Noting that p,q,w in the Sturm-Liouville equation are generic and can fit a wide range of functions, the differential 
equations of Sturm-Liouville type are not as restricted as might be thought. 

[132] We should note that other conditions and restrictions are required to make the explanations and arguments in this 
answer more rigorous. The details should be sought in more specialized texts on Sturm-Liouville problems and their 
relation to variational problems and variational calculus. 

[133] Again, the reader should be careful in reading and interpreting this equation and its alike. This is due mainly to 
the (rather loose) use of I; /I2 where sometimes it stands for the stationary value (corresponding to the stationar- 
izing/extremizing function) and sometimes for the value (corresponding to an arbitrary function which is usually an 
approximation to the stationarizing/extremizing function). We should also note that there are certain restrictions on 
Am and Ax for the validity of this equation (the details should be sought in more specialized texts). 

[134] Ty fact, with certain conditions and insights the values of I; /I2 may also provide approximations to the corresponding 
eigenvalues (as will be clarified later). 

1135] The subscript t in yz in this Problem stands for “trial” and does not symbolize partial derivative with respect to t. 
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16 87 Pen HBG 
— ee “ ; x — 8a* + 58x? — 602? 4 250] Sar: 
1 1. 
In = yi c+ +23 — 62? + 5a)” dx = f (a® + 2x7 — 112° — 22° + 462+ — 602? + 2527) dx 
0 0 
Tse, iy Lg. 46-4 25 3] 1247 
= L524 + = = 
"+ ae a a 1260 


Hence, an upper bound on the lowest eigenvalue is I, /Iz = 4896/1247 ~ 3.9262. 
(b) For the trial function y, = x? — 3x? + 3x we have: 


1 1 
: oy 9 
= | a” — 9x4 + 1823 — 182? 4 on == 
5 0 O° 
1 iL. 
Ih = | (a? — 3a? + 3a)” a= | (a° — 62° + 15x* — 180° + 927) dx 
0 0 


1 9 | 8G 
Ea x® + 3a? 5st t in| = id 


I 


Hence, an upper bound on the lowest eigenvalue is I, /Iz = 14/5 = 2.8. 


(c) For the trial function y, = —a? + 2? + & we have: 
i 1 
Li, = f (—3a? + 22 + 1) ax = f (9a* — 120° — 2a? + da +1) dex 
0 
1 
17 
= =a =a + 2x? 4 2| = 5 
1 1 
In = [i —x +2 ag)" ae = f (c° — 20° — a+ + 227 + 2”) dex 
0 0 
oe hod dd al Lal 
7 Ear SP a a ee 


Hence, an upper bound on the lowest eigenvalue is I, /Ig = 238/93 ~ 2.5591. 


(d) For the trial function y, = 2x — x? we have: 
: : 4 ,]' 4 
er | 2-20)? dr= | (4-80 +407) dx = |4r — 407 + -2°| == 
0 0 3 Jo 3 
1 1 571 
4 
In = ¢ (20 — 2”)? ac =f (4a? — 4x + «*) dx = |-2? — 244 . = 
‘ ‘ 3 5] 15 
Hence, an upper bound on the lowest eigenvalue is I, /Ig = 5/2 = 2.5. 
(e) For the trial function y, = x — 323 + x? + 32 we have: 
a are ae aN : 2 gs SOO: eee 9 
o= Ax x + 2x 4 dz = 16x 60x” 4 x 18x xu + 6x 4 dx 
‘ 2 2 " 4 2 4 
16:5 DROS Of BT is wy OLN |, OFF 
= |227-10 + 30 4 - 
Fa oe Oi OF ge AL O10 
1 2 1 
5 3 33 13 9 
In = i (=*- =g3 +47 + 5°) ax = f G bai + 6 _ 948 x* +303 4 o*) dx 
2 2 ' 4 2 4 


7 
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Beck 51989 


337 16 _ 13s onl. 
4” |, 2520 


5 8 
= x is x x 
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28 3 10 


3 
baa 4 


Hence, an upper bound on the lowest eigenvalue is I, /Ig = 3324/1339 ~ 2.4825. 

Discuss the results of Problem 11. 

Answer: Let first obtain the exact solution of the given Sturm-Liouville problem. In fact, this problem 
was solved in part (b) of Problem 7 of § 1.8 as a variational problem with constraint [but without 
the boundary condition y(1) = 1]. Now, if \ < 0 then (according to Problem 7 of § 1.8) the solution 
is y = 0 which does not satisfy the boundary condition y(1) = 1 (which we imposed in Problem 11). 
Therefore, we should have \ > 0 and hence the solution is y = bsin (3) which when combined with 
the boundary condition y(1) = 1 yields b = 1. So, the exact solution (eigenfunction) of the given 
Sturm-Liouville problem is y = sin (3) with an eigenvalue \ = 1?/4 ~ 2.4674 (as can be checked by 
substitution in the equation y” + Ay = 0 and verifying the given boundary conditions). 

Now, if we plot this solution (see Figure 68) beside the trial functions (which can be seen as approximate 
solutions) of Problem 11 we can see that as the trial functions become closer and closer to the exact 
solution (as we move from a to e) the approximations to the eigenvalue become closer and closer 
to the exact eigenvalue and hence the best estimation of the eigenvalue is obtained from the best 


approximation of the eigenfunction. 
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Figure 68: Plot of the exact solution y = sin (3) of Problem 12 of § 7 alongside the trial functions of 


2 


Problem 11 of § 7. The curve of the exact solution is solid thick while the curves of the trial functions are 
labeled with a, b, c, d, e (according to their labels in Problem 11 of § 7) with the curve e being dashed 
for clear distinction. 


13. 


Verify the result of Problem 6 by showing that using the exact solution (i.e. eigenfunction) of Problem 


12 as a trial function will produce the (exact) eigenvalue. 
Answer: The exact solution is y = sin (3) and its derivative is y’ = 5 cos (3). Hence (see Eqs. 131 
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and 132): 


Therefore, I, /Iz = 1?/4 which is the (exact) eigenvalue 4.!134 


[136] In fact, this shows that the stationary values of I, /I2 of a (restricted) variational problem are the eigenvalues of the 
corresponding Sturm-Liouville problem (see Problem 10). It should be obvious that the “stationary value” means the 
value obtained from using the stationarizing (or extremizing) function. 


Chapter 8 
Rayleigh-Ritz Method 


This is an approximation technique for finding the extremizing function in variational problems. The 
method can be instigated and applied numerically as well as analytically. Thanks to its flexibility, relative 
simplicity and natural adaptability to numerical implementations it is widely used in science and engineer- 
ing. The method is based on starting from a guess representing a generic form of the extremizing function 
where this guess can be regarded as an approximation to the real (or exact) extremizing function which 
is the sought solution to the variational problem. The generic form is then developed and determined by 
a variational procedure (possibly in a gradual process as will be clarified later in the following remarks 
and in some of the upcoming Problems). 

To put it in more practical terms, suppose that we are looking for an extremizing function y = y(z) of 
an integral I[y] representing the functional of a variational problem. Also assume that the function y can 
be approximated by a linear combination of linearly independent basis functions ¢; = ¢;(a) (¢ = 0,--- , 7) 
of a certain type (e.g. polynomial or sinusoidal) and hence we can write: 


y= got > CiPi (133) 
i=1 


where c; are constants to be determined during the variational procedure.!'37! So, all we need to determine 
y (or rather its approximation) is to determine the constants c;’s since the basis functions are known. This 
means that we simply converted (or rather reduced) our task from determining y itself to determining 
a set of constants assuming that the general form of y is known (as determined by the chosen type of 
the basis functions). In fact, our functional integral I[y] can now be written (rather more appropriately 
although we will not do that) as I[c,,--- ,c;] because this functional is now dependent (in its variation 
and optimization) on the constants c;’s. In other words, this functional is extremized (or stationarized) 
by the set of c;’s and hence we can tackle the variational problem by seeking the solution of the system 
of equations: 
ol 
Oc; = 

In fact, the best way to understand the rationale and procedure of the Rayleigh-Ritz method is to use 
it in some practical problems to understand and appreciate how and why it works. However, before that 
it is important to note the following points about this method: 
(a) Although the guessed form (as determined by the choice of the type of the basis functions) is rather 
arbitrary, this form should be chosen to be as close as possible to the real form (assuming that we have 
an idea about the real form) so that we can get the best approximation to the real solution. In fact, if the 
form of the real solution is known then the guessed form should be chosen to match the real form so that 
the obtained approximation will be very close (and potentially identical) to the real solution. For example, 
if we know that y is a sinusoidal function then we should choose our basis functions to be sinusoidal so 
that we obtain better results. 
(b) The Rayleigh-Ritz method as described above can be generalized and extended to include other 
variations and flavors such as having multiple variables. So, in this regard it is like the variational 
treatment in its analytical form (as represented by the Euler-Lagrange equation in its various variations 
and flavors which we investigated in the sections of chapter 1). 


0 (i =1,--+ ,n) (134) 


1137] The choice of cg to be 1 does not affect the generality since we can always divide by co 4 0 (or absorb non-unity factor 
into ¢9) to reduce the form of y to the above form. In fact, this is related to the implementation of the boundary 
conditions as will be clarified later. 
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(c) The Rayleigh-Ritz method can be used for estimating the eigenvalues in Sturm-Liouville problems. 
(d) Noting that the Rayleigh-Ritz method is used in boundary value problems, the zeroth basis function 
¢o is usually chosen to satisfy the given boundary conditions while all the other basis functions ¢; (i = 
1,--- ,n) are chosen to vanish at the boundaries.|"3*! 

(e) As indicated above, the determination of the form of the extremizing function may be done in a gradual 
process by starting from a certain order of approximation and moving to higher orders of approximation (if 
necessary) where this process stops when we get the required accuracy from the most recent approximation 
(according to certain criteria). For example, we may start from first order approximation y ~ y. = 
oo + c1¢, where we need only to determine c,. If, we are happy with y; then we stop the process; 
otherwise we go to the second order approximation y ~ yz = ¢o0 + c1¢1 + co¢2 where cz is estimated 
with re-estimation of cj. This may be followed by other approximations where in each approximation 
(say the i” approximation) the value of the constant c; is estimated while the values of c,,--- ,¢—1 
are re-estimated. The main issue (and the fundamental presumption) in this gradual process is that each 
approximation is better than (or at least not inferior to) the previous approximation so that we are always 
heading toward better approximations to the real solution hoping that in the end (i.e. if we continue with 
this process) we get very close to the solution y (or we may even get to the solution itself if we are lucky 
and made good choices). In fact, if this procedure develops as described above (i.e. the approximations 
improve persistently) then we should expect that we can make our approximation as close as we wish 
to the real solution by increasing the order of approximation (as represented by n) and hence we should 
expect to converge to the real solution when n — oo, that is:!!89 


y=oor+ ye Ci Pi (135) 
i=1 


(f) The above-described Rayleigh-Ritz method is one dimensional. The method can be easily generalized 
to multi-dimensions (although the required algebra and mathematical manipulation become very lengthy 
and messy). However, instead of going through the description of this simple generalization we will 
demonstrate this generalization by some examples of the Rayleigh-Ritz method in 2D (see Problems 6-9). 


Problems 


1. Describe the Rayleigh-Ritz method in a few words. 

Answer: It is a variational method that employs basis functions to find approximate solutions for 
boundary-value variational problems. 

2. Re-solve part (a) of Problem 12 of § 1.4 (with k = 1) using this time the Rayleigh-Ritz method in a 

gradual process. Plot the obtained approximation in the end of each stage of approximation (starting 
from y,) and hence stop this gradual process when the obtained solution is sufficiently close (visually) 
to the analytical (exact) solution that you obtained in Problem 12 of § 1.4. 
Answer: Noting that k = 1, we have I[y] = f° (y? + y?) da with y (#1 =0) =0 and y (#2 = 1) =1. 
We assume that we have no idea about the general form of the real solution and hence we use polynomial 
basis functions. Referring to point (d) in the text, we select the zeroth basis function ¢o to satisfy 
the given boundary conditions and select all the other basis functions ¢; (¢ = 1,--- ,n) to vanish at 
the boundaries. So, if we choose ¢9 = x (which is the straight line passing through the two boundary 
points) then the boundary conditions are satisfied by this function because ¢9(0) = 0 and ¢o(1) = 1. 
Also, if we choose the other basis functions so that they all contain the factor x(a — 1) then they will 
all vanish at the boundaries because the x factor will ensure the vanishing at x = 0 while the (a — 1) 
factor will ensure the vanishing at 2 = 1. Accordingly, we can write the n‘” approximation y,, (ie. the 
approximation obtained in the n“” stage of the gradual process) as: 


Yn =L+u(x—1) [er ter t-++ + ene 


[138] 1, may be more appropriate to say: while the other basis functions are chosen so that all the terms involving these 
functions vanish at the boundaries. This depends on the meaning of “basis functions”. Anyway, this is a trivial matter. 
[139] Tn fact, there are other conditions and restrictions and hence the above description is not sufficiently rigorous. 


8 RAYLEIGH-RITZ METHOD 199 


As we see, this approximation obviously satisfies the two boundary conditions at any stage of this 
process. 
Now, the first approximation is: 


yy =aet+qa(z—1l)=cq2?+(1—-ca)2 (136) 


On substituting this into the integrand of the functional integral we get: 


yr ty ~ Paet+(1— c))? + [era? +(1-—c) ae 


= 42x? +4e, (1-1) a+ (1-1)? +224 
= Acta? 4c, 2 Acta t1—2c +e ; 
= @et+ (2c — 2ci) x 4 (5c} 2c; 4 1) x 4 (4c Act) x (4 2c; + 1) 


2 
+ (1—¢)*2? 
2c8n? +27 — 207 +42 


2c1 (1 — cr) x? 
3 


tC t 
2 oe +22 


Cc 


On substituting this into the functional integral and integrating we get: 


1 


: 2c, — 2c? Bef — 2 1 Ac, — 4c? 
Py Ve) as Reals gia) a + (cf —2c, +1) a 
5 4 3 2 6 
2 2 2 2 
Ct 2c, — 2c{ dey — 2c, + 1 4c, — 4c} 9 
= 2c, +1 0 
E oF ( 1 ) ( 3 + 9 (4 Cy ) 
ll, 1 2 4 
= Re = 
3000 68 
On differentiating I with respect to c; and setting the result to zero we get: 
dl il 1 5 
ae i 5 ol r 0 and hence Cy 55 


5 8 | 5 ee eee 
Yy=—2 (i aye A lee eee 
On plotting this solution (plotted as circles in Figure 69) alongside the analytical solution (plotted as 
solid curve in Figure 69) we see that it is sufficiently close. So, we stop this gradual process. 

. Re-solve part (g) of Problem 12 of § 1.4 using this time the Rayleigh-Ritz method in a gradual process. 
Plot the obtained approximation in the end of each stage of approximation (starting from y;) and 
hence stop this gradual process when the obtained solution is practically indistinguishable (by vision) 
from the analytical solution that you obtained in Problem 12 of § 1.4. 

Answer: We have I[y] = f° (y/? — y? — 2ay) dx with y (x; =0) = 1 and y (#2 = 1) = 2. We assume 
that we have no idea about the general form of the real solution and hence we use polynomial basis 
functions. Referring to point (d) in the text, we select the zeroth basis function ¢o to satisfy the 
given boundary conditions and select all the other basis functions ¢; (¢ = 1,--- ,n) to vanish at the 
boundaries. So, if we choose ¢9 = 1+ x (which is the straight line passing through the two boundary 
points) then the boundary conditions are satisfied by this function because ¢9(0) = 1+0 = 1 and 
go(1) = 1+1 = 2. Also, if we choose the other basis functions so that they all contain the factor 
x(a — 1) then they will all vanish at the boundaries because the «x factor will ensure the vanishing at 
x = 0 while the (a — 1) factor will ensure the vanishing at x = 1. Accordingly, we can write the n‘” 
approximation yp, (i.e. the approximation obtained in the n*” stage of the gradual process) as: 


Yn =14+ata(e@—-1) [ep tert-:-+en2"""] 


As we see, this approximation obviously satisfies the two boundary conditions at any stage of this 
process. 
Now, the first approximation is: 


yy =l+tetae(e—1)=c27+(1—c)e+1 (137) 
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1 T T T 


— Analytical 
(e) yy 


Xx 


Figure 69: Plot of the analytical solution y = eee of Problem 2 of § 8 (as obtained in part a of Problem 


12 of § 1.4 with & = 1) alongside the first Rayleigh-Ritz approximation y, = ae + sau. 


On substituting this into the integrand of the functional integral we get: 


y? —y*-2ay ~ PRae+(1—c)]* [ex + (1 c)@+1]* — 22 [cz?+(l1—c)x+1] 
= [4cfa* + 4,2 — 4cf@ + 1 — 2c, + cf] 
lene +9e4° — 222° + Ie? +27 — 2e,07? + ex + 2% — 2c," + 1] _ 
[2c,2° + Qa? — 2c,a? + 22 
= =-cCr*+ (2ci — 4c;) x 4 (3c} + 2c4 3) x? 4 ( 4c? + 6c, — 4) L+ Ce; = 2c1) 


On substituting this into the functional integral and integrating we get: 


1 
i Se 5 2c? — 4c, ee 3c? + 2c, — 3 aia —4c? + 6c, — 4 4 (2 2e1) a 
5 4 3 2 6 
2 2 2 2 
Ct 2cy—4c,\ | (sey +2c, —3 —4cy + 6c; — 4 2 
. | + ( 7 y+ 3 ‘i 5 alee) 0 
3 2 
= ant + 301 F 3 
On differentiating I with respect to c; and setting the result to zero we get: 
dl 6 2 10 
eee Tee a and hence ea 


So, the first approximation is (see Eq. 137): 


10 19 
Y1 et grtl 
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On plotting this solution (plotted as dashed curve in Figure 70) alongside the analytical solution 
(plotted as solid curve in Figure 70) we see that it is close but distinguishable. So, we go to the next 


approximation. 
T T T 
— Analytical eps 
2 ioea Ne GQSa SSBC 
o Yo 


0.8 1 
X 
Figure 70: Plot of the analytical solution y = cos x + (35898 9) sina — x of Problem 3 of § 8 (as obtained 
in part g of Problem 12 of § 1.4) alongside the first Rayleigh-Ritz approximation y; = — Pe? + er +1 
and the second Rayleigh-Ritz approximation y2 = a? Sen x? 4 Sr +1. 


Now, the second approximation is: 


yo =lta+ea (2 —1) 4+ cox? (2 —1) = cox? + (cy — cg) 2? +(1—c1) 241 (138) 
On substituting this into the integrand of the functional integral we get: 


2 


y?—-y?-20y ~ [3cga” +2(cy-—co)a+(1 c)]° [cox® + (ey — cg) 227+ (l—cq) a+ 1] 
2a [coa® + (ce) — cg) 27+ (1-1) e+] 
= —ca® + Qe _ 2cc2) x + (8c3 — or + 4c, c9 — 4c) ee 
(2c; + 10¢,c2 — 123 — 4c, + 2c2) ee + (4c5 — 14¢,c9 + 2c, + 8c2 — 34 37) x? 4 
(4cic2 4c? + 6c, — 4c2 4) g+ (ef _ 2c1) 


On substituting this into the functional integral and integrating we get: 
ae [- gar. (Mazae) 4 (od toate) 
7 6 5 


(= + 10c,¢2 — 12c2 — 4c, + “22 pie (4 — 14cycg + 2c, + 8c2 —3 + *) go 
4 3 
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1 


4 Ac? 4 . 4 4 
_ a eee ) a” + (cf — 21) | 
0 
a! | (= — 2c1c2 ; 8c3 — Gs + 4c1C9 — 4c2 . 
7 6 5 : 
2c? + 10c1¢2 1265 Ac, + 2c Ac? — 14c,c2 + 2c) + 8c2 — 3+ 3c? % 
4 : 3 
4 4c? . 4 4 
( sat ato )+e 20) 0 
_ 3 _ 18 oe 7 ee 3 
7 in” iG 10522 8 80 
On differentiating J with respect to c, and ce and setting the results to zero we get: 
Ol 38 # 30 2 0 
OG ee ee 
Or 38 i 26 = ll 0 
04 10° 105° 30 
On solving this system of simultaneous equations we get c, = —34 and cg = -#. So, the second 


approximation is (see Eq. 138): 


14 5 347 14\ 5 347 ‘ ; 
= ! ! 1 b1= 1 
ame sh ( 369 a) es ( - aa = Ae 36g" ~ 369" 


On plotting this solution (plotted as circles in Figure 70) alongside the analytical solution (plotted as 
solid curve in Figure 70) we see that it is indistinguishable from the analytical solution. So, we stop 
this gradual process. 

4, Re-solve part (c) of Problem 12 of § 1.4 using this time the Rayleigh-Ritz method in a gradual process. 
Plot the obtained approximation in the end of each stage of approximation (starting from y,) and 
hence stop this gradual process when the obtained solution is practically indistinguishable (by vision) 
from the analytical solution that you obtained in Problem 12 of § 1.4. 


Answer: We have I[y] = diets ve dx with y (x, = 2) = Land y (a2 = 4) = 31. We again use polynomial 
basis functions. Referring to point (d) in the text, we select the zeroth basis function ¢o to satisfy the 
given boundary conditions and select all the other basis functions ¢; (¢ = 1,--- ,n) to vanish at the 
boundaries. So, if we choose ¢9 = 15a—29 (which is the straight line passing through the two boundary 
points) then the boundary conditions are satisfied by this function because ¢o(2) = 30 — 29 = 1 and 
go(4) = 60 — 29 = 31. Also, if we choose the other basis functions so that they all contain the factor 
(a — 2) (a — 4) then they will all vanish at the boundaries because the (x — 2) factor will ensure the 
vanishing at « = 2 while the (a — 4) factor will ensure the vanishing at « = 4. Accordingly, we can 
write the n*” approximation y,, as: 


Yn = 15a —29+ (a — 2) (a — 4) [cr | Cot +++ + en2"~"] 
— 15a — 29 + (a? 6x 4 8) [er + cor+---4 ou | 


Now, the first approximation is: 
y1 = 15x — 29+ cy (x? — 6x 4+ 8) (139) 
On substituting this into the integrand of the functional integral we get: 


y? [15 + c (2x — 6)]” 
a3 
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[2c1@ — 6c, + 15]° 
a 
Act x? + (60c, — 24ct) x + 36c? — 180c, + 225 
a3 


= Acix~* + (60c, — 24ci) a7? + (36c? — 180c, + 225) x? 


On substituting this into the functional integral and integrating we get: 


— 
2 


36c? — 180c, + 2) ot] . 
2 2 


Act In x + (24c} — 60¢;) 2~* ( 


2 ol 22 
2 Act In4 + (24c? — 60c1) 47 (= oe z *) | 
a 22 
act In2 4 (24ci a 60c1) 9-1 (= a + *) | 


128In2—84\ , 15  — 675 
~ 32 Cig 


On differentiating I with respect to c, and setting the result to zero we get: 


I 128In2 — 84 1 
ah 2( Rae? Ja er and hence c, © 6.352111366 


dc 32 
So, the first approximation is (see Eq. 139): 
y. = 15a — 29 + 6.352111366 (2? — 62 + 8) 


On plotting this solution (plotted as dashed curve in Figure 71) alongside the analytical solution 
(plotted as solid curve in Figure 71) we see that it is close but distinguishable. So, we go to the next 
approximation. 

Now, the second approximation is: 


yo = 15a — 29 + (x? — 6x + 8)(c, + C22) (140) 


On substituting this into the integrand of the functional integral we get: 


ei [15 + (2a — 6) (c1 + cox) + cy (a? — 6x + 8) 

fae 

= 9e5x + (12cic2 — 72c}) + (Acj — 84c1c2 + 192c5 + 90c2) > + 
(176cyc2 — 24c} — 192c3 + 60c, — 360c2) 7? + 
(36c} — 96cic2 + 64c} — 180c, + 240c2 + 225) x? 


On substituting this into the functional integral and integrating we get: 


9 
[Te fae + (12c1c2 — 72c3) D+ (4ci — 84c;¢9 + 192¢3 + 90c2) Ina — 


(176cic2 — 24ct — 192c5 + 60c, — 360c2) a~* — 


(36c} — 96c1c2 + 64c3 — 180c; + 240c2 + 225) x~? 


2 


L 
5 
[ (128 In 2 — 84) c? + (1888 — 2688 In 2) crc + (6144 In 2 — 4224) c2 — 60e) + 


a 
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35 1 1 
— Analytical 
sees Y, 
30 i fe) Y5 | 


or | l | 


2 2.5 3 3.5 4 


Figure 71: Plot of the analytical solution y = = —1 of Problem 4 of § 8 (as obtained in part c of Problem 


12 of § 1.4) alongside the first Rayleigh-Ritz approximation y; = 152 — 29 + 6.352111366(x? — 6x + 8) and 
the second Rayleigh-Ritz approximation y2 = 152 — 29 + (a? — 6x + 8)(2.526375488 + 1.455927429z). 


(2880 In 2 — 2160) co + 675 


On differentiating I with respect to c, and c. and setting the results to zero we get: 


ol 
aa (128 In 2 — 84) cy + (1888 — 2688 In 2) co — 60 = 0 
1 
ol 
sa (1888 — 2688 In 2) c; + 2 (6144 In 2 — 4224) cy + (2880 In 2 — 2160) = 0 
C2 


On solving this system of simultaneous equations we get cy ~ 2.526375488 and co ~ 1.455927429. So, 
the second approximation is (see Eq. 140): 


Yo = 15a — 29 + (x? — 6x + 8)(2.526375488 + 1.455927429.2) 


On plotting this solution (plotted as circles in Figure 71) alongside the analytical solution (plotted as 
solid curve in Figure 71) we see that it is indistinguishable from the analytical solution. So, we stop 
this gradual process. 

5. Re-solve part (e) of Problem 12 of § 1.4 using the ys Rayleigh-Ritz approximation. Plot the obtained 
Ys approximation alongside the analytical solution that you obtained in Problem 12 of § 1.4. 
Answer: Following a similar method to that used in the previous Problems, we have: 


2 uy 2 3 4 
Y = “2+a(2-2) [er +ere+ coe + cy? + 52°] 


= c50° + (c1 =¢s J x 4 (cs *ca) a op 
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2 
(ce es) ger (1 co) Pom E = a1] x (141) 


On substituting this into the functional integral and integrating we get: 
n/2 
T= i (y’? —y*? +ycoshx) dx 
0 


n/2 
A dx bese” + 5 (c1 ~c5) gt+4 (cs ~ca) 2 
0 2 2 


l2 


[esa t (c1 cs ) x 4 (cs es) x 4 


2 
(c2 =e x 4 (a 5e2) x 4 (2 Fa] q cosh x 


~ 0.973158c} + 1.528633c1c2 + 1.463175¢1¢3 + 1.561649¢1¢4 + 1.78517 1e1¢e5 + 
1.050358c¢5 + 2.563096cgc3 + 3.133517 cac4 + 3.904376c2c5 + 1.791482¢3 + 
4.786866c3Cc4 + 6.350638c3c5 + 3.406354cj + 9.47767 1c4cs + 6.842153c2 
—0.263643c, — 0.177926c2 — 0.157084c3 — 0.161129c4 — 0.181082c5 + 1.453547 


On differentiating I with respect to c1,---+ ,cs and setting the results to zero we get: 
Ol 
Aa 1.946316c¢, + 1.528633c2 + 1.463175c3 + 1.561649c,4 + 1.78517 1¢c5 — 0.263643 = 0 
1 
I 
ar = 1.528633c, + 2.100716c2 + 2.563096c3 + 3.133517c4 + 3.904376c5 — 0.177926 = 0 
2 
ol 
ae 1.463175c1 + 2.563096c2 + 3.582964c3 + 4.786866c,4 + 6.350638c5 — 0.157084 = 0 
C3 
ol 
a 1.561649c¢, + 3.133517c2 + 4.786866c3 + 6.812708c4 + 9.47767 1c5 — 0.161129 = 0 
C4 
ol 
aa 1.785171c, + 3.904376c2 + 6.350638c3 + 9.477671c4 + 13.684306c5 — 0.181082 = 0 
5 


On solving this system of simultaneous equations we get c, ~ 0.1680159606, co ~ —0.05218684043, 
c3 & 0.006210311128, c, ~ 0.004267046318 and cs; ~ 0.0003668967898. So, the y; Rayleigh-Ritz 
approximation is (see Eq. 141): 


Ys ~ 0.0003668967898x° + 0.003690726188269a° — 0.000492349554578a* 
—0.0619419743381 16a + 0.24999085785447527 + 0.372700918614185x 


On plotting this solution (plotted as circles in Figure 72) alongside the analytical solution (plotted as 
solid curve in Figure 72) we see that the ys Rayleigh-Ritz approximation is virtually identical to the 
analytical solution. 
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1 T T T T T T 
— Analytical 
ie) Ye 


1.6 


Figure 72: Plot of the analytical solution y = —+ cos x + (1 - + cosh z) sing + ; cosh x of Problem 5 of § 
8 (as obtained in part e of Problem 12 of § 1.4) alongside the y; Rayleigh-Ritz approximation. 


6. Re-solve Problem 6 of § 1.6 using this time the Rayleigh-Ritz method. Plot the obtained approximation 
of the Rayleigh-Ritz method and compare it to the analytical solution that you obtained in Problem 
6 of § 1.6. 

Answer: This is a 2D problem and hence in the following we will extend (rather briefly) the above- 
described 1D Rayleigh-Ritz method to 2D. We have I [z] = i ie (22 — 22) dady with z(0,y) = 
z(#,0) = z(l,y) = z(a,1) = 0 and z(0.5,0.5) = 1. We again use polynomial basis functions. 
Referring to point (d) in the text (and noting the extension to 2D), we select the zeroth basis function 
go to satisfy the given boundary conditions and select all the other basis functions ¢;; to vanish at 
the boundaries. So, if we choose ¢9 = 0 (which is the plane z = 0 that passes through the four 
boundary lines) then the above boundary conditions are obviously satisfied by this function. Also, 
if we choose the other basis functions so that they all contain the factor xy(a — 1)(y — 1) then they 
will all vanish at the boundaries because the factors x, y, (a — 1), (y — 1) will ensure the vanishing at 
(0, y),(v,0),(1,y), (2,1) respectively. Accordingly, we can write the mn‘” approximation zm as: 
Zmn = O+ ey(e—1)(y—1) [er + core + crey + coory + 3107 + c13y? +-°° + ore aa aes 

(a*y? — a*y — zy* + vy) [er + core + croy + Coovy + c3127 + cigy® + +++ + Cmne™ *y” "| 
So, let try the zg approximation, that is: 


22 = (ey = xy _ ry” at ry) [c11 + cat + croey+ C222Y| 
fa?y? + (b— f) ay? + (e— f) x?y? — bay 
ery? + (a —b—c4t f) x*y? + (b— a) 27y + (c— a) zy” + ary (142) 


where for the sake of simplicity and clarity we use in the second line (and subsequently) a,b,c, f to 
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represent C11, C21, C12, €22.!!4°l On substituting this into the functional integral and integrating twice 
we get: 


1 1 
pe [ [ @-2)aeay 
0 0 


1 1 
| ay [ dx [3f07y? +3 (b— f) ay? + 2(ce— f)ay® — 3b2?y — cy? +2(a—b—c+ f) ay’ 
0 0 


l2 


-2(b—a)ey +(c—a)y? tay] — [3f2%y? +2(b— f)a°y +3 (e— f)ary? 


2 
ba? — 3cay” + 2(a—b—ct+ f)2’y+ (b—a)2?+2(c a) ey + az] 


I 


no _ dy [ {28/? + 70cf + 70c?} y6 

{—56f? + (56b — 140c + 70a) f — 140c” + (70b + 140a) c} y? 

{10f? + (7c — 112b — 140a) f + 7c” — (140 + 280a) c + 28b? + 70ab + 70a7} y* 
+ {24f? + (84¢ + 32b + 28a) f + 84c? + (28) + 56a) c — 56” — 140ab — 140a7} y? 
{—8f? + (28b — 28¢ + 49a) f — 28c? + (49b + 98a) c + 20b” + 42ab + 4207} y? 
{— (8b + 14a) f — (14b + 28a) c + 8b? + 28ab + 2807} y — (2b? + Tab + 7a} 


2 (b? — c? + bf —cf) 
1575 


As we see, I is independent of a and hence a (which stands for c,;) is an arbitrary constant that can 
be set to zero, i.e. a = 0.!!44] On differentiating J with respect to b,c, f and setting the results to zero 
we get: 


al 
Ob 
al 
Oc 
OL 
of 


On solving this system of simultaneous equations we get b = c = -f So, the z22 approximation is 
(see Eq. 142 noting that a = 0): 


I 


2b+ f =0 
= -—2c—f=0 


= b-—c=0 


3 3 1 1 1 1 
3,3 3,2 2.3 3 3 272 2 2 
gy? — <x y? — =x “¢ a >) ages = 
222 r( y ans gv Y ge gee vy gv Y 5) 


To determine f we use the constraint z (0.5,0.5) = 1, that is: 
222 a 


f= ay — Sa3y? — 37248 + Sa3y + dary? + 2x?2y? — $a2y — dary? ~ 0.015625 


64 


IR 


Hence: 


3 3 1 1 1 1 
Zo9 = —64 (20° se sty + sty + ty + 2e7y7 — xt y = se") 


[140] We use f instead of d or e to avoid potential confusion with dx and the number e ~ 2.71828. 

[141] Ty fact, this should have been anticipated earlier from the fact that ¢9 = 0 and all the boundary conditions are zero 
and hence ci; (which represents the constant term) should be zero. However, we preferred to do it the long way to 
demonstrate this fact. 
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On plotting this solution (see the upper frame of Figure 73) alongside the analytical solution (see 
the lower frame of Figure 73) which we obtained in Problem 6 of § 1.6 we see that although the z22 
approximation is not sufficiently accurate, it is still useful. In fact, we may expect a better result from 
higher order approximations although the algebra becomes increasingly messy and difficult. However, 
the algebraic difficulties can be overcome with automation where computer codes can take care of the 
required hard work. 

7. Re-solve Problem 7 of § 1.6 using this time the Rayleigh-Ritz method. Plot the obtained approximation 

of the Rayleigh-Ritz method and compare it to the analytical solution that you obtained in Problem 
7 of § 1.6. 
Answer: We have I [z] = if (i (22 + 22) dx dy with z(0,y) = z(,0) = z(1,y) = 0 and z(a,1) = 
sin (72) sinh (7). We again use polynomial basis functions (apart from the zeroth basis function which 
should satisfy the boundary conditions and hence it can take any necessary form, as will be clarified 
next). Referring to point (d) in the text (and noting the extension to 2D), we select the zeroth basis 
function ¢9 to satisfy the given boundary conditions and select all the other basis functions ¢;; to 
vanish at the boundaries. So, if we choose ¢o = sin (7x) sinh (ay) then the above boundary conditions 
are obviously satisfied by this function. Also, if we choose the other basis functions so that they all 
contain the factor xy(a — 1)(y — 1) then they will all vanish at the boundaries because the factors 
x,y, (% — 1),(y — 1) will ensure the vanishing at (0,y),(x,0),(1,y),(#,1) respectively. Accordingly, 
we can write the mn approximation Zp, as: 


Zmn = sin (max) sinh (zy) + xy(x — 1)(y — 1) fen + cat + Cray + co2%y 4 
cg1@” + ci3y? +++ + ee a 


= sin(7a) sinh (ry) + (a7? Say ay? + ry) feu + cat + Cray + Corry + 
m—-1,n—-1 


C310" + C13Y" ++ + Cmnt Yy 


So, let try the z22 approximation, that is: 


zg = sin (ma) sinh (wy) + (#7y* — ay — ay? + xy) [er + coe + Crey + eo2ry| 
= sin (r2)sinh (ry) + fa°y? + (b— f) ay? + (c— f) 2? y? — baty 
cry? + (a—b—c+ f)a?y? + (b— a) 2’y + (c— a) ay? + axy (143) 


where for the sake of simplicity and clarity we use in the second line (and subsequently) a, b,c, f to 
represent C11, C21, C12, C22. On substituting this into the functional integral and integrating twice we 


get: 
1 pl 
i: i (ze + 25) dx dy 


1 1 
i ay [ dx [7 cos (rx) sinh (ry) + 3f2?y? + 3(b— f) ay? + 2(ce— f)zy® 
0 0 


— 
II 


l2 


2 
3ba7y — cy® +2(a—b—c4 f) ay? +2(b—a)ry+(c a)y? + ay] + 


lr sin (wx) cosh (ry) + 3fa°y? +2(b— f)x®y+3(e—- f)2?y? 


2 
ba? — 3cay” +2(a-—b—c+ f)2°y+(b—a)2?+2(c a) ey + ac] 


1 
= 79600 [s2/? + 4{24(c+b) + 35a} f + 96c? + 140 (b+ 2a) c + 96? + 


35 {8ab + 8a? + 450 (e?" — e-*")} | 
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0 0 
y x 


Figure 73: Plot of the z22 approximation of the Rayleigh-Ritz method of Problem 6 of § 8 (upper frame) 
alongside the analytical solution z = sin(aa)sin (ay) of Problem 6 of § 1.6 (lower frame). For fair 
comparison, the same xy mesh is used in both plots. 


8 RAYLEIGH-RITZ METHOD 210 


On differentiating I with respect to a,b,c, f and setting the results to zero we get: 


T 

a = 140f + 280c + 280b + 560a = 0 
a 
I 

a = 96f + 140c + 192b + 280a = 0 

ol 

a 96f + 192c + 140b + 280a = 0 
c 

Ol 

Of = 64f + 96c+ 966+ 140a = 0 


On solving this system of simultaneous equations we get a = b =c = f =O. The reason is that our 
“approximate” function (i.e. z22) contains the analytical solution (as represented by ¢o) and hence all 
the other terms should be zero. So, our z22 “approximation” is the exact solution that we obtained in 
Problem 7 of § 1.6, i.e. z = sin (7a) sinh (zy). Hence, the plot of the z22 “approximation” is the same 
as the plot of the analytical solution that we obtained in Problem 7 of § 1.6 (see Figure 2). 

8. Re-solve Problem 8 of § 1.6 using this time the Rayleigh-Ritz method. Plot the obtained approximation 

of the Rayleigh-Ritz method and compare it to the analytical solution that you obtained in Problem 
8 of § 1.6. 
Answer: We have I {z] = fo ih (2 Repeal 2s) dx dy with z(0,y) = z(x,0) = z(1,y) =0 
and z(#,1) = sin (az). We again use polynomial basis functions (apart from the zeroth basis function 
which is determined by the boundary conditions and hence it can take any necessary form, as will be 
clarified next). Referring to point (d) in the text (and noting the extension to 2D), we select the zeroth 
basis function ¢o to satisfy the given boundary conditions and select all the other basis functions ¢;; 
to vanish at the boundaries. So, if we choose ¢9 = ysin(ra) then the above boundary conditions 
are obviously satisfied by this function. Also, if we choose the other basis functions so that they all 
contain the factor xy(a — 1)(y — 1) then they will all vanish at the boundaries because the factors 
x,y,(% — 1),(y — 1) will ensure the vanishing at (0,y),(x,0),(1,y),(#,1) respectively. Accordingly, 
we can write the mn!” approximation zn as: 


Zmn = ysin(rx) + cy(a —1)(y— 1) fen + Coit + Croy + Co2ry 4 
C3107 + cy3y* +++ + eae a 
= ysin (12) + (a7y? — xy _ ry” + ry) fen + Coit + C12Y + C22KYy + 


m—-1,n-1 


3107 + ci3y” + +++ + €mn& y 
So, let try the zg2 approximation, that is: 


22 = ysin(mx) + (a*y? — ay — ay? + ry) [cry t+ cae + cray + ca2ry] 
+ fay’ + (b— f)a®y? + (c— f) a? y? — br*y 
cay? +(a—b—ct f)x?y? + (b—a)az?y + (c— a) xy 


Nee; 


= ysin(7x 
2 


+ axy (144) 


where for the sake of simplicity and clarity we use in the second line (and subsequently) a, b,c, f to 
represent C11, C21, C12, C22. On substituting this into the functional integral and integrating twice we 


get: 
1 pl 2 
2 
i: | (242-02? +) dx dy 
0 Jo y 


1 1 
| ay [ dx [ry 00s (nx) + 3fax7y? + 3(b— f) 2?y? + 2(c— f) xy® — 3ba7y 
0 0 


— 
ll 


l2 
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cy® +2(a—b—c+ f) ay? +2(b-—a)ry+(c—a)y’ 4 ay) 


[sin (wx) + 3fa%y? +2(b— f)a°y+3(c— f)2?y? — br? — 3cry? 


+2(a—b—c+ f)x?y+(b a)a? +2(c~a)ay tae] + 


(Zr?) [ysin (ne) + Foy? + 6— A) a8y? + (em f)aty? — bo%y 


2 
cay? + (a—b— c+ fay? + (ba) a?y + (Ca) xy? + ay) 


= x = | (8° — 2800) f? + (280° — 8680") of + (28n° — 952n*) bf 
+ (49n° — 1470n*) af — 117600f + (28m° — 868n°) c? + (49° — 14707°*) be 
+ (98m° — 29401) ac — 235200c + (287° — 12327) b? + (98° — 39207) ab 


—352800b + (987° — 39207°) a* — 705600a — 1323002] 


On differentiating I with respect to a,b,c, f and setting the results to zero we get: 


a = 2(98n° — 3920r*) a + (98n° — 39207*) b + (98n° — 29407°) c+ 
(49n° — 1470n°) f — 705600 = 0 

a = (98m° — 3920n°) a + 2 (28° — 12320) b + (49° — 1470n*) c+ 
(28m° — 952m°) f — 352800 = 0 

a = (98° — 2940) a + (49n° — 14707) b + 2 (287° — 8687") c+ 
(28m° — 8687°) f — 235200 = 0 

a = (49n° — 14707*) a + (28m? — 952n°) b+ (282° — 8682°) c+ 


2 (8m — 2807*) f — 117600 = 0 
On solving this system of simultaneous equations we get: 


120007? — 388800 168000 — 168007? 
a= 3.863893488 b= f=0 C= 7 _ 14805 4+ 362009 ~ 0.031307421 


am? — 14875 + 362073 


So, the z22 approximation is (see Eq. 144): 


zoo «~ -ysin (rx) + 0.03130742127y° — 0.031307421 ay? — 3.89520090927y + 
3.86389348822y + 3.895200909ry? — 3.863893488.2y 


On plotting this solution (see the upper frame of Figure 74) alongside the analytical solution (see the 
lower frame of Figure 74) which we obtained in Problem 8 of § 1.6 we see that the z22 approximation is 
almost identical to the analytical solution. In fact, we should expect a better result from higher order 
approximations (e.g. 233) although the algebra becomes increasingly messy and difficult. 

9. Re-solve Problem 10 of § 1.6 using this time the Rayleigh-Ritz method. Plot the obtained approxima- 
tion of the Rayleigh-Ritz method and compare it to the analytical (series) solution that you obtained 
in Problem 10 of § 1.6. 
Answer: We have I [z] = i ie (22 + 22 — 42) dxdy with z(0,y) = z(a,0) = z(1,y) = z(#,1) = 0. 
In fact, the domain and boundary conditions for this Problem are the same as those of Problem 6 and 
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Figure 74: Plot of the z22 approximation of the Rayleigh-Ritz method of Problem 8 of § 8 (upper frame) 
alongside the analytical solution z = y? sin (rx) of Problem 8 of § 1.6 (lower frame). For fair comparison, 


the same xy mesh is used in both plots. 
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hence the basic formulation (as represented by Zn) is the same. So, let try the z33 approximation, 
that is: 


2,2 2 2 2 2 
233 = (ay? —a?y— ay” +. 2zy) |e +ciey + cisy? + core + Coozy + co32y 


2 2 29 
€31XL" + C32N7Yy + C33X7Y 


= katyt+(h—k)a?y* + (j —k) ety? + (c—h)a?y* + (i-j)aty? t+ (g-—h—-—jtk) yt 
(b—c—gt+h)a7y? — cry + (f —g—itj) ey? — ixtyt+ (a—b—ft+g)2?y? + 
(c—b) ay? + (i— f)a’y + (f —@)27y + (b- a) ay? + azy (145) 


where for the sake of simplicity we use in the second equality (and subsequently) a,b,c, f,g,h,i, j,k 


to represent C11, C12, C13, C21, €22, €23, €31, €32, €33- On substituting this into the functional integral and 
integrating twice we get: 


1 pl 
i | (22+ Z, —4z) dxdy 
o Jo 


1 1 
i ay | dx [4ha3y4 +3 (h— k)a?yt +4 (j — hk) ay? +2(c— h) ay! +4 (i — jf) 0%y? 
0 0 
3 ( 


g—-h—-jtk)2’*y?+2(b—c—gth)ay — cyt 4+3(f—g-—it j)2’y? — 4ix?y 


— 
II 


l2 


2(a—b—f+g)ay? +(c—b) yi +3(- f)z’y+2(f-—a)zy+(b a)y tay) + 


[4katy? +4(h—k)23y? +3(j —k) aty? +4(c— kh) a?y? +2(6—j) uty 


+3(g—h—jtk)o%y?+3(b—c—g+h)a7y? — 4ery? +2(f —g—i+ 9) xy — iat 
2 
2(a—b—ft+g)2°y+3(c—d)ay’? + (i- f) 2? +(f —a) 2? +2(b-a)ay4 ax| 


4) katy! + (hk) ay" + (5 — boty? + (c—h)a?y* + (i jaty? 


(g-h—-j+k)a®y? +(b—c—gt+h)a’y> — cay* + (f —g -i+j)x°y? — iaty 
(a—b— f-+g)a°y? + (c-b) ay? + (i— f)a%y + (f —a)a°y + (b-@) ay? + avy] 


1 
= 564600 [58800 + 5880ab + 3444ac + 5880af + 2940ag + 1722ah + 3444a2 + 172207 + 


1008ak + 2016b? + 2814be + 2940bf + 2016bg + 1407bh + 1722bi + 1176bj + 819k + 
1106c? + 1722cf + 1407cg + 1106ch + 1008ci + 819cj + 642ck + 2016 f? + 2016 fg + 
1176 fh +2814 fi + 1407 fj + 819fk + 672g? + 924gh + 1407gi + 92497 + 630gk 4 
356h2 + 819hi + 630hj + 480hk + 1106i7 + 1106ij + 642ik + 3567? + 480k + 

180k? — 29400a — 14700b — 8820c — 14700 — 7350g — 4410h — 8820i — 4410 — 2646k 


On differentiating I with respect to a,b,c, f,g,h,71,7,& and setting the results to zero we get: 


a = 11760a + 58806 + 3444c + 5880f + 29409 + 1722h + 34447 + 17227 + 1008k — 29400 = 0 
a = 5880a + 4032b + 2814c + 2940 f + 20169 + 1407h + 17222 + 11767 + 819k — 14700 = 0 
a = 3444a + 2814b + 2212c + 1722 f + 14079 + 1106h + 10082 + 8197 + 642k — 8820 = 0 

as = 5880a + 2940b + 1722c + 4032 f + 20169 + 1176h + 28142 + 14077 + 819k — 14700 = 0 


of 
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io = 2940a + 2016b + 1407c + 2016 f + 13449 + 924h + 14077 + 9247 + 630k — 7350 = 0 
a = 17224 + 1407b + 1106c + 1176f + 924g + 712h + 819% + 6307 + 480k — 4410 = 0 

a = 3444a + 1722b + 1008c + 2814f + 14079 + 819h + 22127 + 11067 + 642k — 8820 = 0 
7 = 1722a4+1176b + 819c + 1407f + 924g + 630h + 11067 + 7127 + 480k — 4410 = 0 
ar = 1008a + 819b + 642c + 819f + 630g + 480h + 6422 + 4807 + 360k — 2646 = 0 


On solving this system of simultaneous equations we get a = 4, b = —21/4, c = 21/4, f = —21/4, 
g = 63/4, h = —63/4, i = 21/4, 7 = —63/4 and k = 63/4. So, the z33 approximation is (see Eq. 145): 


63 63 63 21 
%33 = | gty* 5 x y4 5 ay? + 21x y* + Qaty? + 6823 y? — 4207y3 — 42234? — zy" 


21 121 21 
eg ty 4 gy? 4 


375 
7 mi 5 xy” + dary 


4 


3, 2l 
aya EY 


On plotting this solution (see the upper frame of Figure 75) alongside the analytical (series) solution 
(see the lower frame of Figure 75 where we used in this plot all the series terms up to and including 
m =n = 7) which we obtained in Problem 10 of § 1.6 we see that the z33 Rayleigh-Ritz approximation 
is almost identical to the analytical (series) solution. 
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Figure 75: Plot of the z33 approximation of the Rayleigh-Ritz method of Problem 9 of § 8 (upper frame) 
[=-1)")G—-[-1]") 

mn(m?2+n?) 

Problem 10 of § 1.6 (lower frame). For fair comparison, the same xy mesh is used in both plots. 


sin (max) sin (nay) of 


alongside the analytical (series) solution z = 4 Se an a 


Chapter 9 
Numerical Methods 


There are many numerical methods for solving variational problems. These methods are generally based 
on discretizing the continuous (analytical) formulation of the calculus of variations problems and imple- 
menting the discretized formulation version manually or computationally (using computers). In this book, 
we just investigate briefly one of the simplest (and possibly the simplest) numerical variational methods 
which is based on the finite difference technique. The idea of this method is very simple although the 
technique is algebraically messy in most cases (especially if it is implemented manually). This method is 
outlined in the following points: 
1. The analytical variational formulation is fundamentally based on a functional integral (as given by 
Eq. 1). Now, the integral is the continuous version of the discretized version of a sum. So, we simply 
convert the functional integral to a sum, that is: 


where we replaced J (for Integral) with S (for Sum) and replaced the infinitesimal dx with the finite 
Ag. 

2. We discretize the x interval (as determined by the boundaries which represent the limits of the integral) 
to n+1 equal divisions by inserting n points evenly between the two boundaries. Accordingly, we have 
Ag = "7 where x and x41 are the # coordinates of the boundary points. So, we have n + 2 
points (i.e. 2 boundary points and n inserted points) and n + 1 divisions. We can therefore rewrite 
the sum of Eq. 146 as: 


i=0 
where yj = 44%. 
3. We can now form a table summarizing our n+ 2 points. This table shows what is known and what is 
unknown (which what we should look for in our solution), that. is: 


Xo X1 x2 oem emer err ee nne Vent La emia 
Yo a ow ee 4 V 7 V 
Fill clot case | Paw Slee ? ? ? y 
YO Y1 Y2 Yn— 1 Yn Yn+ 1 


where the check mark Y means known while the question mark ? means unknown. Accordingly, $ 
in Eq. 147 becomes a function (rather than functional) of n unknowns which are y1,--- , Yn. We can 
therefore rewrite the sum of Eq. 147 as: 


n 
i=0 
where we keep the square brackets as a reminder although S is actually a function of y1,--+ 5 Yn. 
4. Eq. 148 means that S (which represents our “functional integral”) is a function of the n variables 
Y1,°** , Yn and hence to extremize S' we should stationarize it with respect to these n variables simul- 


taneously by taking the partial derivatives of S with respect to these n variables and set them to zero, 
that is: 

Os 

suman £25: 6 
Oy 


216 
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as 


eS eo. 
OYn 
We then solve this system of n simultaneous equations and obtain the unknowns yj,--- , yn. 
. So, we now have n + 2 known points, i.e. 
(wor ¥o)s (itt), Cha, Yopt oe arses (ie Ug 20) 5, ey ie) Any Pee) (149) 


We can use these points as an approximation to the extremizing function either directly (and hence the 
extremizing function is a a polygonal curve that connects these points consecutively by straight line 
segments) or as an input to an interpolation schemes (such as polynomial and spline interpolations). 


We should finally remark that the approximation generally improves by increasing n (although at a 
cost that may not be affordable or desirable at a certain point). We should also note that the above- 
described finite difference method is one dimensional. The method can be easily generalized to multi- 
dimensions (although the required algebra and mathematical manipulation become very lengthy and 
messy). However, instead of going through the description of this simple generalization we will demonstrate 
this generalization by some examples of the finite difference method in 2D (see Problems 6-8). 


Problems 


1. Re-solve part (c) of Problem 12 of § 1.4 using this time the finite difference method with n = 3 


discretization scheme (i.e. by inserting 3 evenly-spaced points between the boundaries and hence 
making 4 divisions). Plot the obtained approximation points beside the analytical solution that you 
obtained in Problem 12 of § 1.4. 

Answer: From the required discretization scheme n = 3 and the boundary conditions yo (ap = 2) = 1 
and y4 (v4 = 4) = 31 we get Az and form the table, that is: 


A Tn+1— LO 341 — © v4 — LO 4—2 1 
Lo = = — 
n+1 3+1 A 4 2 


Xo vy v2 x3 v4 
2);25) 3 | 35) 4 
1 ? ? 2? | 31 
Yo | ¥1_ | Y2 | Y3 | Y4 


Now, we use Eq. 148 with n = 3 and F = y’?/z°, that is: 


S lyr, Yo, ¥3] = 


av} 
= (yir1 — yi)” 
c a Ax 
i=0 4 
at A= yo) . Ye—m)° , (ys— ye)” z (ya — ys)” 
Tene ahr ee Ae PEA 
— ofa)? | =n), vs—m)? | 814s)" 
8 125/8 27 343/8 


(yi — 291 +1), 16 (y3 — 2y1y2 + yi) 
4 125 


9 NUMERICAL METHODS 218 


2 (8 — 2yoys + y2) 16 (961 — 62ys + ¥8) 
OF 343 


4 1118, 1 992 61847 
243 T 996193 — 941 343% 7 7379 


189 , 32 682g 
500"! 1957182 * 337592 — 97 


On taking the partial derivatives of S with respect to y1, yo, y3 and setting the results to zero we get: 


as _ 189 39 _ 
di, 0. Ee 
aS _ 82 1364 de 30 
Bis. DB BA OR 
OS _ 4, 2236, 992 _ 
Oyz a7? 9261-343 
On solving this system of simultaneous equations we get: 
_ 667 3209 6449 
v1 187 ye Ba ae a4 


On plotting the 5 points (i.e. 2 boundaries and 3 inserted) beside the analytical solution (which we 
obtained in Problem 12 of § 1.4) we get Figure 76. 


35 


T 
— Analytical 
° y; 


X 


Figure 76: Plot of the analytical solution y = = —1 of Problem 1 of § 9 (as obtained in part c of Problem 


12 of § 1.4) alongside the points yj, y2, y3 obtained by finite difference (as well as the boundary points). 


2. Re-solve part (a) of Problem 12 of § 1.4 (with &k = 1) using this time the finite difference method 
with n = 3 discretization scheme (i.e. by inserting 3 evenly-spaced points between the boundaries and 
hence making 4 divisions). Plot the obtained approximation points beside the analytical solution that 
you obtained in Problem 12 of § 1.4. 
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Answer: From the required discretization scheme n = 3 and the boundary conditions yo (ap = 0) = 0 
and y4 (#4 = 1) = 1 we get Az and form the table, that is: 


Ag = 2ntia 0 _ Pati — fo _ %4— Lo 1-0 1 


n+1 3+1 4 4 


Xo xy i) x3 TA 
0 | 0.25 | 0.5 | 0.75 | 1 
0 ? ? ? 1 


Yo Y1 Y2 ¥3 Y4 


Now, we use Eq. 148 with n = 3 and F = y” + y?, that is: 


3 
Stn.yoys] = > [(i)? +97] Ae 
i=0 
3 a ‘* 
= (BS) +] os 
; Ax 
1=0 
3 
(Yin — Yi) 2 
a cA 
a Gs + yj, AL 
yi — Yo) 2 (yo-—y1) on, (ys—ye)” . 2 (ya— ys)” , 9 
= re + yp Aa + + yy Aa he + ys Ax + = + y3Aa 
0 Yi Yo ¥3 
Sha 0) eo ean) ea ey 
33 33 33 
Yi — 8yye + 43 — Byoys + 3 — Bys + 4 
On taking the partial derivatives of S with respect to y1, yo, y3 and setting the results to zero we get: 
Os 33 
ae yi — 8y2 = 0 
Oy1 os 2 2 ue 
Os 33 
= —8y1+ 8y3 = 0 
Ayo Yi 5) Y2 Y3 
Os 33 
= —8y2+ 8=0 
dys Y2 5) ¥3 
On solving this system of simultaneous equations we get: 
— 4096 _ 256 _ 13328 
u* 19041 we Be 8 * 19041 


On plotting the 5 points (i.e. 2 boundaries and 3 inserted) beside the analytical solution (which we 
obtained in Problem 12 of § 1.4) we get Figure 77. 
3. Re-solve the previous Problem using this time n = 4 discretization scheme (i.e. by inserting 4 evenly- 
spaced points between the boundaries and hence making 5 divisions). 
Answer: From the required discretization scheme n = 4 and the boundary conditions yo (ap = 0) = 0 
and ys (a5 = 1) = 1 we get Az and form the table, that is: 
Inti—Xo  La41—-%  X5—-X 1-0 _ 1 


Ax = = — = i 
n+1 44+1 5 5 5 


To | V1 vt v3 T4 | U5 
0 | 0.2 | 0.4 | 0.6 | 0.8 1 
0 ? ? ? ? 1 


Yo | Yi Y2 ¥3 Ya | Ys 
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0.67 7 
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xX 
Figure 77: Plot of the analytical solution y = ecru of Problem 2 of § 9 (as obtained in part a of Problem 


12 of § 1.4 with k = 1) alongside the points y1, y2, yz obtained by finite difference (as well as the boundary 
points). 


Now, we use Eq. 148 with n = 4 and F = y” + y?, that is: 


4 
2 
S[yi,¥2,¥3,y4] = > [w +2] Ax 
i=0 
4 
Yi4+1 — Yi 2 
_— A 
5 [(ast) +t] ae 
: Gar =e) 2 
DB Ag tu As 
_ (y= Yo) aypag GPW ang We) prs 
Ax : Ax : Ax 7 
Ax 
0 2 2 
= 5(-0) +2 45(y2—m) + +5 (ys— yo) + 2 
2 2 
+5 (ya — ys)? + 2 +5(1 ya) t 2 
51 
= 5 (vi + ¥2 +93 +92) — 10 (yiy2 + yous + ysya + ya) +5 


On taking the partial derivatives of S with respect to y1, yo, y3, y4 and setting the results to zero we 
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get: 
Os 102 
= 10y2 = 0 
Oyr 5 at #2 
Os 102 
ys => 5 Y2 10y1 10y3 => 0 
Os 102 
ys => 5 Y3 10y2 10y4 => 0 
Os 102 
aan = ae ya — 10y3 — 10 = 0 


On solving this system of simultaneous equations we get: 


390625 796875 — 1235000 1722525 


Yt 9978951 92 9978951 ¥3 * 9978951 4 9978951 


On plotting the 6 points (i.e. 2 boundaries and 4 inserted) beside the analytical solution (which we 
obtained in Problem 12 of § 1.4) we get Figure 78. 


— Analytical 
° y; 


1 1 L i 
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X 


sinh(2) 
sinh(1) 
12 of § 1.4 with k = 1) alongside the points yj, y2,y3, y4 obtained by finite difference (as well as the 
boundary points). 


Figure 78: Plot of the analytical solution y = of Problem 3 of § 9 (as obtained in part a of Problem 


4, Re-solve part (g) of Problem 12 of § 1.4 using this time the finite difference method with n = 4 
discretization scheme (i.e. by inserting 4 evenly-spaced points between the boundaries and hence 
making 5 divisions). Plot the obtained approximation points beside the analytical solution that you 
obtained in Problem 12 of § 1.4. 

Answer: From the required discretization scheme n = 4 and the boundary conditions yo (ap = 0) = 1 


9 NUMERICAL METHODS 222 


and ys (a5 = 1) = 2 we get Az and form the table, that is: 


Now, we use Eq. 


S [yi Yo, Y3, Ya] 


— Unt1—2o  Taz41—-—X%O _ %s—-—X 1-0 1 


n+1 4+1 5 5 5 


To | V1 v2 r3 Ta | U5 
0 | 0.2 | 0.4 | 0.6 | 0.8 1 
1 ? ? ? ra 2 


Yo | Yl Y2 ¥3 Y4 | Y5 


(a -y- 2wiyh| Az 


2 
Gr ) en 


0 
1 
sf (yi yo)” 2xoYyo + 25 (yo m1) yy 2x21 y1 + 25 (ys — y2)” 


ya — 2ax2y2 +25 (ys — ys)” — yz — 2arzy3 + 25 (ys — ys)” — 4 — 2oun| 
1 2 2 2 2 2 
5 [25 (y1 — 1) — 1-0 +25 (ye — m1)” — yi — Fya + 25 (ys — ya) 


4 6 8 
—ys 592 + 25 (ys — y3) — ¥3 — 53 + 25 (2 — ys)’ — ye - Su 


49 
— (y7 + 45 +93 + y3) — 10 (yry2 + yous + ysya) 


5 
1 124 
~ 55 (252y1 + 4y2 + 6y3 + 508y.) + a 
On taking the partial derivatives of S with respect to y1,y2,y3, y4 and setting the results to zero we 
get: 
Os 98 252 
Oy Ber ree) om 
Os 98 4 
yo 5 Y2 Y1 ¥3 5 
Os 98 6 
dys 5 Y3 Ya Y4 5 
Os 98 508 
Oya 5 Y4 ¥3 5 
On solving this system of simultaneous equations we get: 
_ 11255699 _ 13727273 _ 15517472 _ 16488546 
U1 "8267755 2 ~ "8267755 ¥8 * "8267755 U4 = "8267755 


On plotting the 6 points (i.e. 2 boundaries and 4 inserted) beside the analytical solution (which we 
obtained in Problem 12 of § 1.4) we get Figure 79. 
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223 


— Analytical 


0 0.2 


Figure 79: Plot of the analytical solution y = cos x + ( 


0.4 


X 


3—cos 
sin 1 


0.6 0.8 1 


‘\ sina — x of Problem 4 of § 9 (as obtained 


in part g of Problem 12 of § 1.4) alongside the points y1, yo, y3, ys obtained by finite difference (as well as 


the boundary points). 


5. Re-solve part (f) of Problem 12 of § 1.4 using this time the finite difference method with n = 5 
discretization scheme (ie. by inserting 5 evenly-spaced points between the boundaries and hence 
making 6 divisions). Plot the obtained approximation points beside the analytical solution that you 


obtained in Problem 12 of § 1.4. 


Answer: From the required discretization scheme n = 5 and the boundary conditions yo (ap = 0) = 1 
and ye (ae = 7) = 1 we get Az and form the table, that is: 


Ag = omtia= %0 _ ¥541 7% _ Fo _ 7-0 _ 
n+l 5+1 6 6 6 
To | V1 | %2 | %3 | V4 | U5 | VE 
0 7 27 BYid an oT Om 
6 6 6 6 6 6 
1 ? ? ? ? ? 1 
Yo | ¥1 | Y2 | ¥3 | Y4 | ¥5 | Ve 


Now, we use Eq. 148 with n = 5 and F 


5 
2 
Svinte , Ys] = ye [wi + y? = Ay; cosa] Aa 

i=0 

5 ee 
a ys Gren + y? — 4y; cos x; 

i=0 mB 

5 


a Ve — 2s +9? 
Ax 


y’? + y? — 4y cosa, that is: 


Ax 


+ y? Ax — 4y; cos vide 
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2 ss te — Qyisiyi + y? + y? (Az)? — 4y; cosa; (Ax)? 
a6 Ax 
1 5 
2 2 2 2 2 
>= Re » lp? — Wyiriyi + yz + yz (Ax) — 4y; cos x; (Ax) 
5 
6 2 2 nr? 9 nr? 
a 41 — 2Yi41yi + Yj 4 i i 
= » Ea Yitiyi + UF + ZEuE — yi cosa 
6 nr? nr? nr? an? 
= = [yi 2yiyo + y6 4 36/0 5 Yo costo + ys — 2yoyi + y7 4 aa 5 Yi cose 
qr? aq? qr? qn? 
+y3 — 2ysye + ys + 3622 — Fy y2 cos.x2 + yi — 2yay3 + y3 + age — F ys cosas 
2 2 : 2 nm 2 2 3 2 me 
+y5 — 2ysya t+ ya 4 9g U4 — 9 Ys Cos t4 + U6 2y6ys + Ys + 9g U5 — g Ys Costs 
67 5 qo? 2 2 WT? 2 re/3 
= 4 +14 2 Sg 
7 E Y1 36 9 Y2 yoy + Yt + 36 YW 18 Y1 
12 12 1 
+y3 — 2usye + 92 + sev — pgye tui — Quays +43 + seM3 
2 2 2 2./2 
2 2,0 T bo IS 2g: J3 
+ 2 1 — 2y5 4 
U5 — 2ysya + U4 + aeUa + Te st Y5 + oes ig U8 
6 ; nr yy Se ee 
Sale hae (yf +43 +93 + ua +98) — 2 (yiye + yous + ysya + Yas) 
pa Ne 1 ia 9 12/3 ig ‘ia 
Pegs hee gow! ieee ie. ve 


On taking the partial derivatives of S with respect to yj, Yo, Y3, Y4;Y5 and setting the results to zero 


we get: 


as 
Oy 
as 
Oyo, 
as 
Oys, 
as 
dys 
as 
dys 


€ au Qy2 (2 =) =0 
(1+ =) yo — 2y1 — 2ys . =0 

€ =) y3 — 2y2 — 2y4 = 

€ + =) ya — 2y3 — 2y5 4 = =) 
(1+ Fe) mam (2- SE) =e 


On solving this system of simultaneous equations we get: 


yi & 0.96676626 


y2 & 0.72372541 


y3 & 0.40494232 yg ~ 0.19717646 —o-ys & 0.31762333 


On plotting the 7 points (i.e. 2 boundaries and 5 inserted) beside the analytical solution (which we 
obtained in Problem 12 of § 1.4) we get Figure 80. 
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0.8 


0.4 


0.2 


Analytical 


T T 


225 


Figure 80: Plot of the analytical solution y = (=3-) sinha + cosa of Problem 5 of § 9 (as obtained in 


part f of Problem 12 of § 1.4) alongside the points y1, yo, y3, y4, ys obtained by finite difference (as well as 
the boundary points). 


6. Re-solve Problem 7 of § 1.6 using this time the finite difference method with m = n = 3 discretization 
scheme (i.e. by inserting 3 evenly-spaced points between the x and y boundaries and hence making 
16 square divisions). Plot the obtained approximation points beside the analytical solution that you 
obtained in Problem 7 of § 1.6. 
Answer: This is a 2D problem and hence in the following we will extend (rather briefly) the above- 
described 1D finite difference method to 2D. From the required discretization scheme m = n = 3 and 
the boundary conditions z (0, y) = z(x,0) = z(1,y) =0 and z(z,1) = sin (7x) sinh (7) we get Ax and 
Ay and form the table, that is: 


Lm+1— Lo 1-0 1 nm+1— Yo 1-0 1 

ae ee ane Sea 341 4 
rp =0 tia t2=2 «3 = 3 t= 4 
Yo =0 | 200 = 0 210 = 229 = 0 230 = 0 z49 = 0 
y1 = % | 201 =0 zi =? zg1 =? 231 =? z41 = 0 
y2 = % | 202 =0 z12 =? 292 =? 232 =? Z42 = 0 
y3 = 2 | 203 =0 213 =? 293 =? 233 =? 243 = 0 
Ya 4 zoa =O | 214 = in (4) sinh (7) | za4 = sinh (7) | z34 = sin (22) sinh (7) | z44 =0 


Now, we should generalize Eq. 148 to 2D, that is: 


n 


m 


S [211,°°+ ,2mn] = S- SF (is Yss 2g Zidzs iy) Ax Ay 


j=0 i=0 


(150) 
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where 
ZG+1)j — ij Zi(j+1) — ij 
Zia = ae ae and Zij,y = ot (151) 
On using this equation with m = n = 3 and F = z? + zj, we get: 
3 
S[211,°°* , 233] = S- a [Zia _ Zi yl Ax Ay 
j=0 i=0 
=| (zu+ni— 2)" | (ayy — 4)? 
i+1)j — *4j ag aj 
= | Az A 
>|) + a) | are 
j=0 i=0 
7 3 3 Zisayj _ 22441) 5285 + zi Beg44) _ 22i(j41) 2ij + zi Ra Ky 
aoeN ArAaz AyAy 2 


Now, if we have Ax = Ay (as in our case) then this formula will simplify to the following: 


3 3 
2 2 2 
s=))> baer — 22641) 5245 + Gay — 22%G+41)%i5 + 223,| 


j=0 i=0 
that is: 

S = 2zt5 —2z10200 + 2%, — 2201200 + 229 + (i =0,7 = 0) 
255 — 22092190 + Et — 2211210 D240 (j=1,j7 =0) 
oe — 2z39220 + 23, — 2221220 oes (i = 2,7 =0) 
Zag — 2240230 + 231 — 2231230 + 22%) (i = 3,7 =0) 
22, — 2211201 + Zo — 2202201 + 222, (@=0,7 =1) 
wa, — Qeore1a + 229 — Qz10%11 + 227, 4 @=1,j =1) 
con = 2231221 far a ce ae 22992221 en (4 = 2,9 = 1) 
ra — 2241231 + wes — 2730231 + 225, 4 (i =3,j =1) 
ze, — 2212202 + 23 — 2203202 + 2zpo 4 (¢ =0,7 = 2) 
Fes — 2292212 Tr ae = 2213212 22%, (a = 1,j = 2) 
ws = 2.232222 a is Pa —= 2223222 925 (a — 2,9 => 2) 
ae — 2242239 + Be — 2733232 Dees (i = 3,7 = 2) 
ze, — 2213203 + 24 — 2204203 + 2225 (¢=0,7 =3) 
con = 2.223213 a es 2 ae 2214213 Tr O27 rg (4 = 1,j = 3) 
en = 2.233 223 ae Bi =< 2224223 “Te 922, alo (a = 2,9 = 3) 
2s > 2.243 233 ae ay _ 2234233 ar oe, (é = 3,J = 3) 


On combining similar terms and applying the first three boundary conditions li-e. 2(0,y) = z(a,0) = 
z(1,y) = 0] by eliminating all the terms containing zo; or zo or 24;, this expression of S$ will simplify 
to the following: 


S = Az? +423, +423, + 429, +423, + 422, +422, + 428, + 423, 4 


2 2 2 
Ziq + 244 + 234 — 2221211 — 2212211 — 2231221 — 2292221 — 2232231 — 2222212 — 2213212 


—2239 229 — 2293222 — 2233232 — 2293213 — 2214213 — 2233203 — 2%24203 — 2234233 


On taking the partial derivatives of S with respect to 211,---+ , 233 and setting the results to zero (with 
the application of the fourth boundary condition which concerns 214, 224, 234) we get: 
Os 


— 8211 = 2221 > 2212 =0 
Oz11 
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Os 
O2z12 
Os 
0213 
Os 
0201 
Os 
Oz22 
Os 
0203 
Os 
Oz31 
Os 
Oz32 
Os 
0233 


8219 = 2244 > 2299 _ 2213 = 0 


8213 — 2z19 — 2203 — 2sin (5) sinh (7) = 0 
8291 = 2211 ae 2231 he 2292 = 0 
2212 


8299 2291 


2232 2293 =0 


8293 


2299 2213 2233 2 sinh (1) =0 


8231 rae 2291 = 2232 => 0 


82302 fos 2231 > 2229 ore 2233 =0 


8233 — 2239 — 2203 — 2sin (3) sinn (m) =0 


On solving this system of simultaneous equations we get: 


z11 & 0.673903372 


z21 ~ 0.953043288 


231 & 0.673903372 


Z12 & 1.742570199 


Z2 ~ 2.464366408 


232 & 1.742570199 


213 & 3.832011015 


293 ~ 5.419281947 


233 & 3.832011015 


227 


On plotting these points with the boundary points (see the upper frame of Figure 81) alongside the 
analytical solution (see the lower frame of Figure 81) which we obtained in Problem 7 of § 1.6 we see 
that the two plots are almost identical. 

7. Re-solve Problem 8 of § 1.6 using this time the finite difference method with m = n = 4 discretization 
scheme (i.e. by inserting 4 evenly-spaced points between the boundaries and hence making 25 square 
divisions). Plot the obtained approximation points beside the analytical solution that you obtained in 
Problem 8 of § 1.6. 
Answer: From the required discretization scheme m = n = 4 and the boundary conditions z (0, y) = 
z(#,0) = z(1,y) =0 and z(a,1) = sin (rx) we get Ax and Ay and form the table, that is: 


Ages m= ey bd nal yj oo 1-0 1 
m+1 4+1 5 n+1 4+1 5 
Xo =0 t1=+F a2 = 2 «3 =2 t= 4 a =e 
yo =0 | 200 = 0 210 = 220 = 230 = 0 z49 = 0 250 = 0 
Yi Z Zo1 = 0 a=? Z21 =? 231 =? Za. =? 251 =0 
y2= 2 | 22 =0 212 =? 222 =? 232 =? 2z42 =? 252 = 
y3 = 2 | 23 =0 213 =? 293 =? 233 =? 243 =? 253 = 
Y4 = } Za = 0 Z4 =? 294 =? 234 =? 244 =? 254 = 0 
U5 = 2 205 = 0 215 = sin (4) 225 = sin (4) 235 = sin (32) 245 = sin (¥) 255 = 


On generalizing Eq. 148 to 2D (as we did in Problem 6; see Eqs. 150 and 151) and 


m=n=4and F= 72+ 22+ (2y-? — 1?) 2? we get S[z11,-++ , 244], that is: 


Ss 


a. 
a Il Me 


oO 


Me 
= 
& 
Sy 

8 


°. 
oO 


t1)5 — Zig 


& 
ll a Il 
° 
-——— 
CoN 

xX 

a 

= 


Ax 


+ Cae + (2y;° - nm) zi, Aa Ay 


2 
2i(j41) ~ Pig 
) +( Ay 


2 
) + (295° — 0”) 2 


Aa Ay 


noting that 
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0 0 
y x 


Figure 81: The upper frame is a plot of the points 211,--- , 233 which we obtained in Problem 6 of § 9 by 
finite difference (as well as the boundary points) while the lower frame is a plot of the analytical solution 
z = sin (72) sinh (zy) which we obtained in Problem 7 of § 1.6. For fair comparison, we use the same xy 
mesh in both plots. 
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S 


that is: 


Ss 


2 ~~ 2 
Zigtry) ~ 2%G4) 2g F Zi; 


—2 


Zig — 2210200 4 
226 = 2290210 F 
239 — 2230220 4 
249 — 2240230 4 
259 — 2250240 4 
zt, — 22112014 
ee, = 2291211 4 
we = 2231221 q 
ome = 2241231 = 
ey — 22512414 
Ze = 2212202 =I 
es = 2292212 = 
a5 — 2239292 4 
oe a 2242232 4 
ee — 22522424 
273 — 2213203 4 


2 
293 = 2.223213 F 


2 
233 — 2233293 =r 


oe = 2.243.233 q 
ce > 2.253243 = 
2 = 2214204 = 
Ze = 2294214 q 
234 — 2234204 4 


2 
244 — 2244234 4 


AzvAgr 


tf Wei = 2201200 
- ae = 2211210 
+ 23, — 2201200 
+ 23, — 2231230 
+ 23, — 2241240 
2h rae 2202201 
- ee = 2212211 
- oe = 2299221 
- 225 — 2232231 
+ cae — 2242241 
- oe = 2203202 
- cae = 2213212 
- ee = 2.223.222 
- we. = 2.233.232 
L 259 — 2243249 
r aba = 2204203 
- oe — 2214213 
oe — 2294293 
Ee 2, = 2.234233 
ir em = 2244243 
ee = 2205 204 
- ae = 2215214 
me ye a 2295294 


2 
i 235 = 2235 234 


2 
254 — 2254244 4 


2 
t 245 — 2245244 


3 [es = 22415219 + 5 > 


2 


AyAy 


+ 2 — 0.0477) 2G 
+ 2 — 0.0417) 27, 
+ 2 — 0.041") 235 
+ 2 — 0.0477) 23, 
+ 2 — 0.0417) 2%, 
+ 2 — 0.041") 2G) 
+ 2 —0.04n*) zi; 
+ 2 — 0.042") 23, 
+ 2 — 0.042") 23) 
+ 2 — 0.040") zi, 
+ 2 — 0.0417) zy 
+ 2 — 0.0477) z7, 
+ 2 — 0.0417) 235 
+ 2 — 0.041") 235 
+ 2 — 0.0417) zi 
+ 2 — 0.047) 253 
+ 2 — 0.041") 2723 
+ 2 — 0.0417) 23, 
+ 2 — 0.0477) 235 
+ 2 — 0.0417) 235 
+ 2— 0.0417) 264 
+ 2— 0.0477) 224 
+ 2 — 0.04n*) 23,4 
+ 2 — 0.04n*) 2344 


+ 2— 0.047") zi, 


(2y; 


J 


L1) — 224(5-41) 2g + QZ + 0.04 (2y;° — nm) zi 


Ly] 


Ea — 2415245 + Zig-41) — 22541) 29 + (0.08y; 7 +2— 0.0417) 2] 


iI 
I 


il 
i 


>. 
I 
I 


>. 
I 
I 


il 
I 


iI 
I 


iI 
I 


il 
i 


il 
i 


il 
I 


it 
I 


iI 
I 


I 


iI 
i 


>. 
I 
I 


iI 
I 


iI 
I 


>. 
I 
I 


>. 
I 
I 


>. 
I 
I 


iI 
I 


iI 
I 


iI 
i 


iI 
i 


NR NR RR RR Rs a RE EE SE SE 


LN LE PR IR POR IPRS PE PR FS FS PS PS LN NIN PRR: TEENS OAR FEES TENS ES ENS AEN AN 
lI 
~~ PP PF PRP WwWwwWwWwwnwnnnn vd NY RR RRR TO CT OO oO 


iI 


EF wWnNo Fe CO FW YO KF CO KB wd Fe oO FF WN FF CO FW nN Ff CO 
S&B & &. & &. B&B. G&G. BH &. BH. BH BH. O&O. BH HGH. GH. O&O. OG. 


229 


n”) 4 Aa Ay 


On combining similar terms and applying the first three boundary conditions [i.e. z (0, y) = z(x,0) = 
z(1,y) = 0] by eliminating all the terms containing zo; or zio or 25; as well as inserting the numeric 
values, this expression of S' will simplify to the following: 


S = 2291211 — 22120211 + (6 = 0.047) po 
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—2731 221 — 22992221 ac (6 Ea 0.047) Rh 
2741231 a 2232231 iz (6 _ 0.047) reel 
2249241 + (6 _ 0.047) roe 
2299219 —= 2213212 Alc (4.5 a 0.0477) ae 
—2739229 = 2.223.222 ole (4.5 _ 0.0477) oe 
—2242232 o 2233 232 Tr (4.5 == 0.0477) oe 
—2z43242 + (4.5 — 0.0417) zi 
2293213 — 2214213 + (4.222222 — 0.041”) zis 
27233203 — 2224223 + (4.222222 — 0.041”) 235 
2243233 — 2234233 + (4.222222 — 0.0417) 235 
2244243 + (4.222222 — 0.041”) 275 
~Qro4z14 + sin? (=) 2sin (=) za + (4.125 — 0.041) 23, 

2 2 
—2Q2g4z04 + sin? ( =| 2sin ( =| zea + (4.125 — 0.041?) 23, 
—2244234 + sin” (=) 2sin (= ) zga + (4.125 — 0.0477) 23, 
4 4 
+ sin? ( =) 2sin ( =) zag + (4.125 — 0.049”) 22, 
On taking the partial derivatives of S with respect to 211,--- ,244 and setting the results to zero we 


get: 


Qz14 + 2 (4.222222 — 0.041) 213 = 0 


asin (Z) +2 (4.125 — 0.04n*) 24 =0 


2203 + 2 (4.5 — 0.041) zoo = 0 


2204 + 2 (4.222222 — 0.041) 23 = 0 


2 
2sin ( = +2 (4.125 — 0.041?) zo4 = 0 


2233 + 2 (4.5 — 0.0417) z32 =0 


2234 + 2 (4.222222 — 0.041) 233 = 0 


Os 
= 22941 2212 t 2 (6 0.0477) Z11 = 0 
Oz 1 
as : 
= —2211 — 2299 — 2a13 + 2 (4.5 — 0.0417) z12 = 0 
Oz 2 
Os 
a 2 2 
Axis 412 £23 
Os 
a 2 2 
Deis 213 224 
Os 
= 2211 2231 2229 t 2 (6 0.0477) 221 = 0 
Oz21 
ae = —2291 — 2212 — 2239 
Oz29 
on = 2299 2213 2233 
0223 
Os 
a 2 2 2 
Dizon £23 214 234. 5 
Os 
= 2291 2241 2239 + 2 (6 0.0477) 231 = 0 
0231 
Os 
= —2231 — 2229 — 2242 
0239 
Os 
= 2232 2293 2243 
0233 
On = 2233 2224 2244 
0234 


2sin (=) + 2 (4.125 — 0.047?) z34 = 0 
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os 
= 2231 — 224. +2 (6 — 0.0427) z4, =0 
Oz, 1 
os 3 
= —224) — 2239 — 2243 +2 (4.5 — 0.0417) z42 = 0 
Oza2 
os 
5 = —2249 — 2233 — 2244 + 2 (4.222222 — 0.041”) 243 = 0 
243, 
4 
OF oa 2243 — 2244 —2sin( — ) +2 (4.125 — 0.0477) z44 = 0 
Oz44 5 


On solving this system of simultaneous equations we get: 


Z11 & 0.0242152 


z21 & 0.0391810 


231 ~ 0.0391810 


za © 0.0242152 


Z12 ~ 0.0965504 


299 ~ 0.1562218 


232 & 0.1562218 


242 ~ 0.0965504 


213 ~ 0.2159231 


293 & 0.3493709 


233 = 0.3493709 


Z43 & 0.2159231 


~ 0.3805110 


214 


z24 © 0.6156797 


234 ~ 0.6156797 


za ~ 0.3805110 


On plotting these points with the boundary points (see the upper frame of Figure 82) alongside the 


analytical solution (see the lo 
that the two plots are almost 


wer frame of Figure 82) which we obtained in Problem 8 of § 1.6 we see 
identical. 


8. Re-solve Problem 9 of § 1.6 using this time the finite difference method with m = n = 5 discretization 
scheme (i.e. by inserting 5 evenly-spaced points between the boundaries and hence making 36 square 


divisions). 


Answer: From the required discretization scheme m = n = 5 and the boundary conditions z (0, y) = 0, 
z(x,0) =—2?, z(1,y) =y—l and z(x,1) = x— 2? we get Ax and Ay and form the table, that is: 


Aga tmtit#% _1-0_1 daa Ay = Untizyo 1-01 
m+1 5+1 6 n+1 5+1 6 
m=0[ mah | wae | mee | mee | meh [wet 

yo =0 | 20 =0 | a0 =—36 | 20 =——e | BO=—Be | 0 = —H | HO =—B | Zo =I 
y= Z Zo1 = 0 Zi1=? Zo, =? z31 =? Za, =? 251 =? 21 = -3 
yo=§ | 22=0] 22 =? 222 =? 232 =? 242 =? w2=? | te = —F 
y3 = 2 | 23 =0 23 =? 293 =? 233 =? Z43 =? 253 =? 263 = —3 
Y4 = Z zo4 = 0 Z4 =? Zo4 =? Z34 =? Z44 =? 254 =? 264 = 2 
ys = 6 | 205 =0 | 415 =? 295 =! 235 =? 245 =? 25 =? | 265 =—% 
ys=§ | 6=0| ace= 35 | ~e= 35 | ze = 36 | c= 36 | ze= 35 | 266 =0 


On generalizing Eq. 148 to 


m=n=5and F = 22+ 22 — 4z we get $[z11,-°- 


S 


2D (as we did in Problem 6; see Eqs. 
, 255], that is: 


150 and 151) and noting that 


+ an _ 425] Ax Ay 


2 
ZG41 


ee 


*i(jth) — #5 


Ay 


2 
) Azi5 


5 Pite 29 
Ziggy — 2%i(Gt1)%ig + Ai; 


2) +( 


23 2 
‘ae! 22641) 5235 + 2ij 


Aa Ay 


)9 


4zi;| Ax Ay 


AvAz 


AyAy 
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0 0 
y x 


Figure 82: The upper frame is a plot of the points 211,--- , 244 which we obtained in Problem 7 of § 9 by 
finite difference (as well as the boundary points) while the lower frame is a plot of the analytical solution 
z = y’ sin (72) which we obtained in Problem 8 of § 1.6. For fair comparison, we use the same ay mesh 
in both plots. 
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that is: 


2 
~10 = 2210200 ol 


26 = 2220210 
285 = 2230220 
Zo — 2240230 
259 — 2250240 


2 
260 — 2260250 


oe — 221120 
Be, = 229121 
oe — 223122 
2 = 224123 


2 
251 — 225124 


2 
Z61 — 2261 25 


oe = 2212202 
255 — 2299212 
re poe 2.232222 
Les = 2.242232 
2bq — 2252249 


2 
269 — 2262252 


ae = 2213203 
253 — 2203213 
oe = 2.233.223 
Z43 — 2243238 
es ot 2.253243 


2 
263 — 2263253 


ae — 2214 Z04 
2 = 2294 Z14 
wa — 2234204 
eae = 2244 234 


eae — 2254 ZAA 


Be = 2264 254 


2 
215 — 2215205 


2 
295 2295215 


2 
235 = 2235 25 ae 


2 

245 — 2245235 
2 

255 = 2255 245 


2 
+ 202 J 2292201 + 


2 

ayy 221 210 
2 

r £914 — 2z2 220 


2 
- 231 = 223 230 


2 
+ Z41, 7 224 Z40 


2 
- 251 = 225 250 


2 
- £19 = 2212211 


2 
r £99 — 22992221 


2 
- 239 = 2.232231 


2 
+ Z492 = 2242241 


2 
+ 252 a 2252251 


2 
- 203 = 2203202 


2 
+ 213. 2213212 


2 

r 293 — 2223 222 
2 

- 233 = 2.233232 


2 
r £43 — 2243242 


2 
- 253 — 2253252 


2 
+ Z04 boas 2204203 


2 
+ 214 as 2214213 


2 
+ 294 — 2224223 


2 

+ 234 — 2234233 
2 

+ 244 — 2244243 


2 
t 254 — 2254253 


2 
+ 215 — 2215214 


2 
+ 295 2295 Zo4 oe 


2 
+ 245 — 2245 244 


2 
r 295 — 2205204 7 


- wee = 2.235 234 


- es = 2255254 


2 
+ 206 = 2206205 


2 
r 216 — 2216215 


2 
- 236 = 2.236235 


2 
+ 246 — 2246245 


2 
201 = 22 Z00 1 


2 
226 — 2296295 TF 


2z69 — (200/9) 4 
+ 2279 — (z10/9) 4 
+ 2259 — (220/9) 4 
+ 2239 — (230/9) 4 
+ 2z%y — (240/9) 4 
+ 2259 — (250/9) 4 


(i = 0,7 = 0) 
(i= 1,7 = 0) 
(i = 2,7 = 0) 
(i = 3,7 = 0) 
(i = 4,7 = 0) 
(i= 5,7 = 0) 
(i= 0,7 = 1) 
(@=1,7=1) 
(i= 2,7 = 1) 
(¢= 3,7 = 1) 
(i= 4,7 = 1) 
(¢= 5,7 = 1) 
(i= 0,7 = 2) 
(i= 1,7 = 2) 
(i = 2,7 = 2) 
(i = 3,7 = 2) 
(i= 4,7 = 2) 
(i= 5,7 = 2) 
(i = 0,7 = 3) 
(i= 1,7 =3) 
(i = 2,7 = 3) 
(i = 3,7 = 3) 
(i= 4,7 = 3) 
(i= 5,7 = 3) 
(i= 0,7 = 4) 
(@=1,7 =4) 
(i = 2,7 =4) 
(i= 3,7 =4) 
(i= 4,7 = 4) 
(¢=5,7 =4) 
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2 2 2 
265 — 2265255 + 256 — 22%56%55 + 2255 — (z55/9) 


On eliminating the vanishing terms (from the zero boundary conditions) and simplifying we get: 


S 


825 
385 
3% 
3240 


2 
3259 


4zi 42? Tr 427, 427, 427, zi6 
425, 4255 4255 425, 4255 256 
423, 4235 ae 4235 4234 4235 236 
Aziy Aziy Azis Azhs Azis Z46 
423, 4255 4255 4234 4255 256 


2 2 2 2 2 2 
260 + 261 + 262 + 263 + 264 1 265 


—2z20210 — 2211210 


—2230220 — 2221220 


— 2240230 — 2231230 


— 2250240 — 2241240 


— 2260250 — 2251250 


—2291211 — 22120211 


— 2231 221 — 2222221 


—22%41 231 — 2232231 


—2251241 — 2242241 


—22%61251 — 2252251 


—22z99212 — 2213212 


— 2739229 — 2293222 


— 2242232 — 2233232 


—2259242 — 2243242 


— 2262252 — 2253252 


— 2293213 — 2214213 


— 2233223 — 2294223 


— 2243233 — 2234233 


— 2253243 — 2244243 


— 2263253 — 2254253 


—2z942%14 — 2215214 


—2734224 — 2295224 


— 2244234 — 2235234 


—2254244 — 2245244 


—2764254 — 2255254 


—2295215 — 2216215 


— 2235225 — 2226225 


i= 210/9 
= 2020/9 


— (2490/9 


= 231/9 
— 2a1/9 


— 2302/9 


mac zoa/9 
= 234/9 


= 215/9) 
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On taking the partial derivatives of S with respect to 211,--- 


get: 
os 


Oz1 
Os 


Oz12 
Os 
0213 
Os 
Oz14 
Os 
0215 


Os 
Oz21 
Os 
Oz29 
Os 


0203 
Os 
Oz04 
Os 
Oz05 


os 


Oz31 
Os 
0232 
Os 
0233 
Os 
Oz34 
Os 
0235 


Os 
OZ44 
Os 


Oz42 
Os 


0243 
Os 
OZa4 
Os 
OZ45 


= 8211 


—2245235 — 2236235 — (235/9) 
— 2255245 — 2246245 — (245/9) 


— 2265255 — 2256255 — (255/9) 


235 


,255 and setting the results to zero we 


2219 — 2221 — 2212 — (1/9) = 0 

2211 — 2229 — 2213 — (1/9) = 0 

2212 — 2203 — 2214 — (1/9) = 0 

2213 — 2204 — 2215 — (1/9) = 0 

2214 — 2205 — 2216 — (1/9) = 0 

2209 — 2211 — 2231 — 2222 — (1/9) = 0 
2291 — 2212 — 2239 — 2203 — (1/9) = 0 
2209 — 2213 — 2233 — 2204 — (1/9) = 0 
2293 — 2214 — 2234 — 2205 — (1/9) = 0 
2204 — 2215 — 2235 — 2206 — (1/9) = 0 
2239 — 2221 — 2241 — 2232 — (1/9) = 0 
2231 — 2222 — 2z%49 — 2233 — (1/9) = 0 
2239 — 2203 — 2243 — 2234 — (1/9) = 0 
2233 — 2224 — 2244 — 2235 — (1/9) = 0 
2234 — 225 — 2245 — 2236 — (1/9) = 0 
2249 — 2231 — 2251 — 2242 — (1/9) = 0 
2241 — 2232 — 2252 — 2243 — (1/9) = 0 
2242 — 2233 — 2253 — 2244 — (1/9) = 0 
2243 — 2234 — 2254 — 2245 — (1/9) = 0 
2244 — 2235 — 2255 — 2246 — (1/9) = 0 
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Os 

= 8251 2250 224 1 2261 2259 (1/9) = 0 
Oz51 
os 
=e _ 8252 2251 2242 2262 2253 (1/9) =0 
Oz52 
os 
za hlUS 8253 2252 2243 2263 2254 (1/9) = 0 
0253 
os 

= 8254 2253 2244 2264 2255 (1/9) =0 
Oz54 
Os 

= 8255 2254 2245 2265 2256 (1/9) =0 
0255 


On solving this system of simultaneous equations (with inserting the numeric values from the boundary 
points, ie. z19 = —1/36, z29 = —4/36, etc.) we get: 


z1=0 | a1=—- ] 1 =—% | 1 = 3 | 1 =-3 
Z12 = 36 | 222 =0 232 = —qg | 442 =—§ | 2 =—- 
213= 7 | 23=45 233 = 0 z43 = —5 | %53=—7 
“a= 75 | 24=5 234 = 75 z44 =0 254 = — 35 
wi5=5 | t= 5 235 == Z45 = 5 255 = 0 
In fact, these values are identical to the values of the analytical solution (i.e. z = xy — x”) at these 


points and hence we do not need to compare by plotting. 

. Make a brief comparison between the Rayleigh-Ritz method (of chapter 8) and the finite difference 
method (of chapter 9). 

Answer: We may note the following: 

e The cost of the finite difference method scales up as the mesh is refined (i.e. its size increases) while 
the cost of the Rayleigh-Ritz method is independent of the mesh size since it is based on obtaining a 
closed form. However, the Rayleigh-Ritz method also “scales up” as higher approximations are pursued 
(although this scaling up is not related to the mesh size). 

e The finite difference method is essentially based on algebra while the Rayleigh-Ritz method is es- 
sentially based on calculus (integration). The advantage of algebra over calculus is its simplicity (and 
hence the general availability of solution) while its disadvantage is that it is normally very lengthy and 
messy. 

e Both methods have flexibility and inflexibility with certain types of boundary conditions. So, each 
method has its advantages and disadvantages in this regard. 


Chapter 10 
Hybrid Methods 


Hybrid methods are variational (and optimization) methods that are based on combining and mixing 
other (simpler) methods. In fact, the title of this chapter is very generic and hence it can include many 
methods which combine some of the previously-investigated methods (and possibly other methods). For 
example, we can combine the Rayleigh-Ritz method with a deterministic or stochastic computational 
technique to obtain a Rayleigh-Ritz numerical method (which is a hybrid method since it can be classified 
under chapter 8 and under chapter 9). In this case, the objective of the computational (numeric) method 
is to optimize the parameters (i.e. the parameters c;’s in the case of 1D and the parameters c;,’s in 
the case of 2D and similarly for higher dimensions) of the Rayleigh-Ritz method in one go using an 
optimization algorithm based for example on conjugate gradient or Nelder-Mead or quasi-Newton or 
simulated annealing methods.!!42!_ Anyway, the hybrid methods are generally very useful, flexible and 
practical and hence they represent an effective and powerful tool in the investigations and applications of 
the mathematics of variation in general (and the calculus of variations in particular). Moreover, they are 
usually associated with computer codes and packages and hence they are generally easy to use and easy 
to adapt and transform. Also, their cost is usually negligible because once the computer code or package 
is created or acquired it can be used infinite times with minimum time and effort (where the computers 
usually do all the hard work in the blink of an eye). 


Problems 


1. It is suggested in the text that the Rayleigh-Ritz method can be used in conjunction with a computer 

algorithm to obtain the parameters numerically. Suggest an alternative (hybrid) method in which the 
Rayleigh-Ritz method is used. 
Answer: For example, the analytical approach (which we explained and demonstrated in chapter 
8) can be automated (stage by stage) to obtain an automated analytical (or symbolic) Rayleigh-Ritz 
method (rather than numeric Rayleigh-Ritz method which we suggested above). However, being a 
hybrid method may be disputed unless the automation is based on a different approach for obtaining 
the parameters. 

2. Re-solve Problem 5 of § 8 using the y; Rayleigh-Ritz approximation and employing a numerical opti- 

mizer to obtain the parameters c),--- ,C5 (instead of differentiating J and solving the resulting equa- 
tions analytically). 
Answer: We solved this Problem following a similar method to that used in Problem 5 of § 8 but 
instead of differentiating J and solving the resulting equations analytically we passed the expression 
of J to a numerical optimizer (to optimize I for the parameters c,,--- ,cs5) and we obtained: c, ~ 
0.1680197269, co ~ —0.0522179496, cz ~ 0.0062854969, c4 ~ 0.0041982436 and cs ~ 0.0003880377. 
These values are very close to the values obtained in Problem 5 of § 8. In fact, these values produce 
identical results (up to the sixth decimal place of the values of ys) and identical plot. 

3. Re-solve Problem 9 of § 8 using the z33 Rayleigh-Ritz approximation and employing a numerical 
optimizer to obtain the parameters c11,--- ,¢33 (instead of differentiating J and solving the resulting 
equations analytically). 

Answer: We solved this Problem following a similar method to that used in Problem 9 of § 8 but 
instead of differentiating J and solving the resulting equations analytically we passed the expression of 


I to a numerical optimizer (to optimize I for the parameters ci1,--- ,C33) and we obtained: c11 = 4.0, 
Cg = —5.25, C13 = 5.25, cai = —5.25, C92 = 15.75, C23 = —15.75, C3, = 5.25, C32 = —15.75 and 
c33 = 15.75. These values are identical to the values obtained in Problem 9 of § 8 (noting the 


[142] Ty fact, I used some of these (or similar) methods in my past research in fluid mechanics. However, these methods are 
associated with computer codes and hence they are beyond the scope of this book. 
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correspondence between a, b,c, f,g,h, i,j,k and c11, C12, C13, C21, C22; C23, C31; C32, C33). 
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Absolute value, 6 
Acceleration, 4, 177-181, 186 
Action, 172-174 
integral, 172-174 
Algebra, 118, 152, 162, 198, 208, 211, 217, 236 
Angle, 73, 100, 104, 107, 109, 118, 120, 148, 161, 162, 164, 
171, 181 
Angular 
displacement, 89, 93 
momentum, 178, 179 
speed, 5, 175, 181 
Applied mathematics, 1 
Areal speed, 179 
Arithmetic, 8 
Azimuthal 
component, 175, 177 
coordinate, 69, 71, 73 


Base (of geometric shape), 113, 115, 117, 125, 127, 133-136, 
139, 140, 146-150 
Basis function, 5, 197-199, 202, 206, 208, 210 
Bead, 81, 82, 84-86, 88, 89, 91 
Beltrami identity, 10, 14, 70, 74, 94, 102, 155, 159 
Bernoulli, 81 
Boundary 
condition, 9, 11, 12, 19, 30, 33, 37, 41-44, 46-48, 51, 52, 
54, 55, 62, 94, 102, 128, 130, 159, 160, 191, 193, 
195, 197-199, 202, 206-208, 210, 211, 217, 219, 
221, 223, 225-227, 229, 231, 234, 236 
curve, 40, 41, 54, 55, 58, 59, 61, 62, 76-78, 80 
point, 60, 97, 107, 129, 130, 152, 155, 198, 199, 202, 
216, 218, 220, 221, 223, 225, 227, 228, 231, 232, 
236 
value problem, 198 
Brachistochrone, 81, 82, 84-86, 88, 89, 91, 93 


Cable, 93, 95 
Calculus, 1, 6, 8-10, 93, 95-98, 113, 116, 118, 132, 135, 138, 
141, 150, 161, 162, 236 
of variations, 1, 6-11, 13, 47, 49, 62, 81, 82, 93, 95, 97, 
98, 157, 161, 162, 172, 173, 189, 216, 237 
Cartesian 
coordinate system, 5, 9, 12, 66, 75, 82, 85, 93, 104, 118, 
121, 123, 141, 163, 164, 168-170, 178, 182 
coordinates, 66, 68, 73, 178, 186 
Catenary, 93-97, 128, 130, 166 
Catenoid, 41, 128, 130, 131 
Center of 
gravity, 93 
mass, 4, 185, 186 
Central 
angle, 100, 104, 107, 109 
field, 180 
force, 179, 180 
Centripetal acceleration, 178, 180 
Chain, 93-95 
rule, 14, 15, 17 
Chord (of circle), 107 
Circle, 6, 70, 71, 100, 104-112, 116, 118-125, 127, 149, 156, 
157, 171 


Circular 
arc, 70, 71, 100, 102, 104, 108, 110, 167, 170 
symmetry, 125 
Circumference, 106, 111, 115, 127 
Classical mechanics, 174 
Co-vertex (of ellipse), 126 
Complex number, 6 
Compound pendulum, 182 
Cone, 73-76, 135, 136, 138-141, 146-150 
Conjugate gradient, 237 
Conservation 
of angular momentum, 178, 179 
of energy, 82 
principles, 174 
Conservative 
force, 173, 177, 184, 186 
system, 174, 187 
Constraint, 4, 6, 9, 41, 47-49, 51-53, 63-65, 95, 98, 101, 102, 
105-107, 109, 113, 117, 118, 120, 132, 136-138, 
140, 141, 155, 157, 158, 189, 195, 207 
Continuous, 157, 158, 173, 187, 188, 216 
Continuum, 173, 174, 187 
mechanics, 173 
Coplanar, 80 
Cube, 6, 132, 133, 137, 138 
Cubic curve, 76, 153-155 
Cycloid, 84-86, 88, 89, 91 
Cylinder, 69-72, 75, 128, 137-139, 144-147 
Cylindrical 
coordinate system, 72, 175 
coordinates, 5, 9, 69, 70, 74-76, 129 


Density (or mass density), 5, 93-95, 187, 188 
Diameter, 127, 137, 139, 140, 147-149 
Differentiable, 6 

Differential 

equation, 1, 6, 10, 12, 48, 172, 189, 193 
operator, 4 

Differentiation, 1, 41, 74, 138, 151, 192 
Discrete, 4, 173, 187, 188 

Discretization, 216-219, 221, 223, 225, 227, 231 
Displacement, 180-182, 187 

Distance, 67, 68, 76-80, 125, 161, 181, 184 


Eccentricity (of ellipse), 4, 116 
Eigenfunction, 189, 193, 195 
Eigenvalue, 5, 189, 190, 192-196, 198 
Ellipse, 4, 115, 116, 118, 123-127 
Ellipsoid, 4, 140-142 
Engineering, 1, 197 
Enneper, 41 
Equatorial plane, 179 
Equilateral (triangle), 113, 115, 117, 118, 120, 121 
Equilibrium, 93, 95, 180, 181, 187 
Equipotential surface, 186, 187 
Euclidean 

plane, 66, 67, 76, 77, 79, 80 

space, 4, 5, 40, 41, 66, 68, 161, 163, 186 
Euler-Lagrange equation, 8-16, 19 
Euler-Ostrogradsky equation, 38 
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Euler-Poisson equation, 34 
Explicit dependency, 10-12, 14, 16, 29, 38, 41, 70, 72, 74, 75, 
83, 93, 177 
Extremal 
curve, 10, 11, 55-57, 59-62 
function, 11 
point, 10 
Extremizing function, 5, 23-30, 34, 37, 38, 41-44, 46, 172, 
193, 196-198, 217 


Fastest descent curve, 81, 82, 85, 86, 88, 91-93 
Fermat’s principle, 161, 163, 165 
Finite difference method, 5, 216-218, 220, 221, 223, 225, 227, 
228, 231, 232, 236 
First integral, 10, 172 
Fluid 
dynamics, 173 
mechanics, 237 
Focus (of ellipse), 125 
Force, 5, 86, 95, 172-174, 177-182, 184, 186, 187 
Free 
boundary, 54 
fall, 86, 92, 93 
movement, 54 
particle, 177, 178 
system, 184 
Frequency, 180, 182 
Friction, 174, 187 
Frictionlessly, 81, 82, 85 
Function of function, 7, 8 
Functional, 1, 4, 6-13 
integral, 4, 8, 10, 11, 19, 23, 30, 34, 35, 37-46, 48, 49, 
63, 64, 83, 159, 172, 188, 191, 197, 199-203, 205, 
207, 208, 210, 213, 216 


Galileo, 81 
Generator, 70, 71, 140, 144, 147 
Geodesic, 66-73, 75-80, 161, 186, 187 
Geometrical optics, 161 
Geometry, 107, 125, 162 
Gradient, 4, 177, 186 
operator, 186 
Gravitational 
acceleration, 4, 181 
constant, 179 
field, 81, 82, 93-95, 178-181 
force, 4, 179, 180 
potential, 5, 178, 179 
Gravity, 81, 82, 85, 86, 88, 93, 174, 180-182, 184, 187 
Great circle, 73, 76 


Hamilton (William), 172 
Hamilton’s principle of least action, 172, 173 
Hamiltonian 
formulation, 172, 173, 177-182, 184, 187 
mechanics, 4, 172-175 
Height (of geometric shape), 4, 113-115, 117, 118, 125, 127, 
133, 135-137, 139-141, 144, 146-149 
Helical arc, 72 
Helicoid, 41 
Helix, 70-72 
Homogeneous (optically), 161, 163, 165 
Hooke’s law, 188 
Hookean, 180 
Hybrid methods, 237 


Hyperbolic 
cosine, 93-95, 97, 128, 166 
function, 32, 33 


Imaginary number, 6 

Implicit dependency, 12, 14, 16, 38, 41 

Inertial force, 180 

Infinitesimal, 4, 67, 93, 104, 128, 157, 216 

Inflection, 7, 11, 48, 153, 161 

Inscribed, 118-126, 141-150 

Integrand, 4, 8, 10-12, 16, 48, 49, 172, 199-203 

Integration, 1, 10, 12, 48, 76, 83, 152, 172, 236 
by parts, 191, 192 

Interpolation, 217 

Inverted Cartesian coordinate system, 82, 85, 182 

Isochrone, 82, 89, 91, 93 

Isoperimetric, 98 

Isosceles (triangle), 113, 118, 125, 127 

Isotropic (optically), 165 


Kepler’s second law, 179 
Kinetic energy, 4, 82, 173-178, 180, 181, 183, 184, 187 


Lagrange 
multiplier (or undetermined multiplier), 5, 47-49, 190 
multipliers technique (or method), 4, 47-49, 101, 105, 
106, 111, 113, 114, 116, 117, 132, 133, 136, 137, 
139-141, 155, 157 
Lagrange’s equations, 172, 174 
Lagrangian, 4, 49, 172-177, 179, 181, 183-186, 188 
equations, 172, 177 
function, 172 
mechanics, 172 
Laplace equation, 40 
Law of light 
reflection, 161—163 
refraction, 163, 164 
Line element, 4, 66, 67, 69, 70, 72—75, 168 
Linear mass density, 5, 93-95, 187, 188 
Logarithmic function, 6 


Mapping, 9 

Mass, 4, 82, 93, 174, 177-182, 184-186 
Massive particle, 81, 184 

Massless, 180 

Mathematics, 1, 7, 8, 81, 93, 98, 173 
of variation, 1, 6, 7, 13, 47, 66, 113, 132, 172, 237 
Mechanical system, 172-180, 184 
Mechanics, 9, 172-174 

Meridian, 75, 76 

Minimal surface, 41, 129-131 

Mixed nature, 63, 65 

Moment of inertia, 4, 174 
Multi-variable 

differential calculus, 1, 7 
function, 5, 49 

integral calculus, 1, 7 


nabla operator, 4 

Natural 
boundary conditions, 62 
logarithm, 1, 24, 79 
parameter, 98 

Nelder-Mead, 237 

Newton’s 
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first law, 186 

laws, 173, 174 

second law, 177-180, 186 
Newtonian mechanics, 173 
Non- 

conservative force, 187 

Euclidean, 161 

negative, 1, 6 

oscillating, 187 
Numerical 

methods, 216 

optimizer, 237 


Octant, 141 
Optimal 
curve, 66, 98 
solid, 132, 152, 156 
surface, 113, 127 
Origin of coordinates, 73, 74, 82, 83, 89, 91, 119, 122, 124, 
126, 141, 155, 184 
Orthonormal Cartesian coordinate system, 5, 66, 75 
Oscillation, 187, 188 
Overdot, 4, 16, 70, 172 


Parabola, 77, 79, 150-153 
Parallelepiped, 6, 132, 133, 136-138, 141-143 
Partial 
derivative, 5, 9, 12, 15, 38, 49, 54, 64, 113, 114, 117, 
119, 120, 126, 132, 133, 136, 137, 139-141, 188, 
193, 216, 218-220, 222, 224, 226, 230, 235 
differentiation, 1, 12, 41 
Particle, 81, 173, 174, 177, 178, 180-182, 184, 186, 187 
Pendulum, 181, 182, 184 
Perimeter, 4, 6, 49, 98, 107, 109, 111, 113-118, 120-128, 157, 
171 
Physics, 1, 7, 11, 93, 95, 173, 174 
Planar 
curve, 81, 98, 99, 101, 102, 105, 106, 127, 128 
shape, 98, 113, 116, 118 
Plane, 5, 40, 41, 66, 67, 69, 73, 76, 77, 79-82, 84, 91, 93, 104, 
106, 109, 111, 125, 127, 157, 163-170, 174, 177, 
179, 181, 182, 184, 187, 206 
curve, 40, 104, 106, 109, 111, 125, 127, 157 
Poisson equation, 39, 40 
Polar 
angle, 171 
coordinate system, 167, 168 
coordinates, 5, 9, 67, 167, 168, 170, 177 
plot, 170, 171 
Polyhedron, 132 
Polynomial function, 33, 197-199, 202, 206, 208, 210, 217 
Position vector, 4, 177, 184, 185 
Potential energy, 4, 82, 93, 95, 173-177, 180-182, 184, 186, 
188 
Prime, 4, 9, 23, 26, 34, 44, 66-68, 75, 95, 98 
Principle of 
least action, 172-174 
least time, 161, 163 
Reciprocity, 111 
stationary action, 174 
Product rule, 14, 17, 18 
Profile curve, 75, 127, 129, 130, 152, 160 
Proof by contradiction, 111, 117, 137-139, 158 
Pyramid, 133-136 
Pythagoras theorem, 77, 78, 115, 144, 162, 163 


Quadrant, 122-124 

Quadratic formula, 134, 143, 145 
Quartic equation, 77 
Quasi-Newton, 237 


Radial 
acceleration, 179 
component, 175, 177 
direction, 178 
Radius, 4, 69, 70, 72, 75, 91, 100-102, 105-108, 111, 118, 
120, 121, 123-125, 127, 128, 136, 137, 139, 140, 
144-150, 156-158, 175 
Rayleigh-Ritz 
approximation, 5, 200, 201, 204-206, 214, 237 
method, 4, 5, 197-202, 204—206, 208-212, 214, 215, 236, 
237 
Real number, 4, 6, 8, 9, 52, 134 
Real-valued function, 1, 6 
Rectangle, 6, 49, 104, 111, 113, 116, 117, 121-125 
Rectangular 
Cartesian coordinate system, 141 
parallelepiped, 132, 136-138, 141, 142 
Reduced mass, 4, 186 
Reflection (of light), 161-163 
Refraction (of light), 163, 164 
Refractive index, 4, 163-170 
Regular pyramid, 133-135 
Resistance (to fluid flow), 4, 158-160 
Right circular 
cone, 73-76, 136, 138, 140, 146-150 
cylinder, 69-72, 75, 137-139, 144-147 
Rotational kinetic energy, 4, 175 
Rules of 
differentiation, 192 
variation, 192 


Saddle, 7, 48, 161 
Scalar, 4, 174 
Science, 174, 197 
Second order 
derivative, 34, 63, 64 
differential equation, 12, 48, 172 
Sector (of circle), 104 
Segment (of circle), 100, 107, 108, 120, 121 
Semi- 
axes (of ellipsoid), 4, 140, 141, 143 
circle, 100, 101 
disc, 100 
major axis (of ellipse), 4, 116, 123-126 
minor axis (of ellipse), 4, 123-126 
SI units, 92 
Side (of polygon or polyhedron), 49, 117, 118, 125-127, 135 
Simple 
harmonic motion, 180 
pendulum, 181 
Simulated annealing, 237 
Simultaneous equations, 49, 154, 202, 204, 205, 207, 210, 211, 
214, 217-219, 221, 222, 224, 227, 231, 236 
Single-variable 
differential calculus, 1, 7 
function, 49 
integral calculus, 1, 7 
Sinusoidal function, 32, 33, 180, 182, 197 
Slant height, 140, 141 
Snell’s law, 163, 164 
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Solid of revolution, 6, 150, 152, 155-160 notation, 172 


Speed, 5, 82, 84, 89, 161, 163, 175, 180 Velocity, 5, 172, 177, 179, 184 
of light, 4, 161, 163, 165 vector, 177 
of light in material media, 4, 163, 165 Vertex, 104, 118-127, 141 
of light in vacuum, 4, 163 
of wave, 5, 188 Wave equation, 188 

Sphere, 6, 72, 73, 76, 140, 144-150, 156-158 Wavelength, 161 

Spherical Width, 4, 113, 116, 122-125, 128, 157 
coordinate system, 72, 179, 184 Wire, 82, 85, 88 
coordinates, 4, 72, 73, 75, 179, 184 Work-energy principle, 82, 84 
pendulum, 184 
symmetry, 76 Yo-yo, 174, 175 

Spheroid, 142, 143 

Spline, 217 

Spring, 180 


constant, 180 
Square, 6, 49, 113, 116, 117, 122, 123, 125, 133-135 
brackets, 9, 216 
division, 225, 227, 231 
root, 1, 6, 68 
Stationarization, 4, 8-10, 12, 48, 71, 192, 193, 197, 216 
Stationarizing function, 23, 30, 37, 41-43, 46, 193, 196 
Stationary, 7-10, 14, 47, 48, 151, 154, 161, 162, 164, 172-174, 
192, 193, 196 
String, 174, 175, 181, 182, 184, 187, 188 
Sturm-Liouville 
equation, 189-193 
problem, 5, 189, 190, 192, 193, 195, 196, 198 
Sufficiently smooth, 6 
Sum of sides lengths (of polyhedron), 132, 133, 135, 137, 138 
Surface of revolution, 6, 75, 76, 127-130, 157 
Symmetry, 76, 107, 122, 123, 125, 129, 141, 142, 158, 163 


Tautochrone, 82, 89, 91, 93 
Tension, 5, 95, 187, 188 
Total 
derivative, 4, 11, 12, 19, 21, 22, 28, 29, 38 
energy, 177 
Translational 
kinetic energy, 4, 175 
speed, 175 
Transversality condition, 54-58, 60-62, 65, 76, 77, 79, 80, 86 
Triangle, 104, 113-121, 125-127, 146, 148 
Trigonometric 
function, 91 
identity, 68, 73, 90 


Uniform 
gravitational field, 81, 94 
gravity, 93, 181, 182, 184 
linear density, 93, 94, 188 
mass density, 187 
medium, 161 
motion, 89, 158 

Unit 
circle, 118, 119, 122, 125 
sphere, 145, 146, 150 


Variable boundary, 53-55, 57-59, 61, 62, 65, 76, 77, 79, 80, 
85 
Variational 
calculus, 7, 12, 82, 129, 193 
principle, 7, 8, 10, 13, 14, 161, 162, 164, 172 
Vector, 4, 163, 172, 174, 177, 184, 185 
equations, 174 
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